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HUMAN OXIDOREDUCTASE PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of oxidoreductase proteins and 
5 to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative disorders 
including cancer; endocrine, metabolic, reproductive, neurological, viral, and autoimmune/inflarnmatory 
disorders, and in the assessment of the effects of exogenous compounds on the expression of nucleic 
acid and amino acid sequences of oxidoreductase proteins. 

10 BACKGROUND OF THE INVENTION 

Many pathways of biogenesis and biodegradation require oxidoreductase (dehydrogenase or 
reductase) activity, coupled to the reduction or oxidation of a donor or acceptor cofactor. Potential 
cofactors include cytochromes, oxygen, disulfide, iron-sulfur proteins, flavin adenine dinucleolide 
(FAD), and the nicotinamide adenine dinucleotides NAD and NAQP (Newsholme, E. A. and Leech, 
15 A.R. (1 983) Biochemistry for the Medical Sciences , John Wiley and Sons, Chichester, U.K. pp. 779- 
793). 

Reductase activity catalyzes the transfer of electrons between substrate(s) and cofactor(s) with 
concurrent oxidation of the cofactor. The reverse dehydrogenase reaction catalyzes the reduction of a 
cofactor and consequent oxidation of the substrate. Oxidoreductase enzymes are a broad superfamily 

20 of protdas that catalyze numerous reactions in all cells of organisms ranging from bacteria to plants to 
humans. These reactioas include metabolism of sugar, certain detoxification reactions in the liver, and 
the synthesis or degradation of fatty acids, amino acids, glucocorticoids, estrogens, androgens, and 
prostaglandins. Different family members are named according to the direction in which their reactions 
are typically catalyzed; thus they may be referred to as oxidoreductases, oxidases, reductases, or 

25 dehydrogenases. In addition, family members often have distinct cellular localizations, including the 
cytosol, the plasma membrane, mitochondrial inner or outer membrane, and peroxisomes. 

Tetrahydrofolate is a derivatized glutamate molecule that acts as a carrier, providing activated 
one-carbon units to a wide variety of biosynthetic reactions, including synthesis of purines, pyrrolidines, 
and the amino acid methionine. Tetrahydrofolate is generated by the activity of a holoenzyme complex 

30 called tetrahydrofolate synthase, which includes three enzyme activities: tetrahydrofolate 
dehydrogenase, tetrahydrofolate cyclohydrolase, and tetrahydrofolate synthetase. Thus, 
tetrahydrofolate dehydrogenase plays an important role in generating building blocks for nucleic and 
amino acids, crucial to proliferating cells. 

Intracellular redox status plays a critical role in the assembly of proteins. A major rate limiting 

35 step in protein folding is the thiolrdisulfide exchange necessary for correct protein assembly. Although 
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incubation of reduced, unfolded proteins in buffers containing defined ratios of oxidized and reduced 
thiols can lead to folding into native conformation, the rate of folding is slow, and the attainment of the 
native conformation decreases proportionately with protein size and the number of cysteine residues. 
Certain cellular compartments such as the endoplasmic reticulum of eukaryotes and the periplasmic 
5 space of prokaryotes are maintained in a more oxidized state than the surrounding cytosol. Correct 
disulfide formation can occur in these compartments, but it occurs at a rate that is insufficient for 
normal cell processes and inadequate for synthesizing secreted proteins. 

Protein disulfide isomerases (PDIs), thioredoxins, and glutaredoxins are able to catalyze the 
formation of disulfide bonds and regulate the redox environment in cells to enable the necessary 
10 thiol:disulfide exchanges. Each of these classes of molecules has a somewhat different function, but all 
belong to a group of disulfide-containing redox proteins that contain a conserved active-site sequence 
and are ubiquitously distributed in eukaryotes and prokaryotes. PDIs are found in the endoplasmic 
reticulum of eukaryotes and in the periplasmic space of prokaryotes. PDIs function by exchanging their 
own disulfide few thiols in a folding peptide chain. In contrast, reduced thioredoxins and glutaredoxins 
15 are generally found in the cytoplasm and function by directly reducing disulfides in the substrate 
proteins. Thioredoxin (Trx), a heat-stable, redox-acti ve protein, contains an active site cysteine 
disulfide/dithiol. Oxidized thioredoxin, Trx-S, can be reduced to the dithiol form by NADPH and a 
specific flavoprotein enzyme, thioredoxin reductase. Reduced thioredoxin, Trx-(SH), participates in a 
number of redox reactions mostly linked to reduction of protein disulfides. Trx and thioredoxin 
reductase (TR), together with NADPH, form a redox complex in which TR catalyzes the electron 
transport from NADPH to Trx. The reduced thioredoxin then functions as an electron donor in a wide 
variety of different metabolic processes. 

Disulfide-containing redox proteins not only facilitate disulfide formation, but also regulate and 
participate in a wide variety of physiological processes. The thioredoxin system serves, for example, as 
a hydrogen donor for ribonucleotide reductase and controls the activity of enzymes by redox reactions. 
Mammalian thioredoxin (MT) acts as a hydrogen donor for ribonucleotide reductase and methionine 
sulfoxide reductase, facilitates refolding of disulfide^ontaining proteins, and activates the 
glucocorticoid and interleukin-2 receptors. MT also modulates the DNA binding activity of some 
transcription factors cither directly (TFIIIC, BZLF1, and NF-kB) or indirectly (AP-1) through the 
nuclear factor Ref-1 . The importance of the redox regulation of transcription factors is exemplified by 
the v-fos oncogene where a point mutation of the thioredoxin-modulated cysteine residue results in 
constitutive activation of the AP-1 complex. Thioredoxin, secreted by cells using a leaderless pathway, 
stimulates the proliferation of lymphoid cells, fibroblasts, and a variety of human solid tumor cell lines. 
Furthermore, thioredoxin is an essential component of early pregnancy factor, inhibits human 
immunodeficiency virus expression in macrophages, reduces H 2 0 2 , scavenges free radicals, and protects 
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cells against oxidative stress (Abate, C.^al. (1990) Science 249: 1157-1161; Rosen, A. el al. (1995) 
Int. Immunol. 7: 625-633; Tagaya, Y. et al. (1989) EMBO J. 8: 757-764; Newman, G. W. (1994) J. 
ExpL Med. 180: 359-363; ami Makino, Y. (1996) J. Clia Invest 98: 2469-2477). 

Short-chain alcohol dehydrogenases (SCADs) are a family of dehydrogenases that share only 
5 1 5% to 30% sequence identity, with similarity predominantly in the coenzyme binding domain and the 
substrate binding domaia In addition to the well-known role in detoxification of ethanol, SCADs are 
also involved in synthesis and degradation of fatty acids, steroids, and some prostaglandins, and arc 
therefore implicated in a variety of disorders such as lipid storage disease, myopathy, SCAD deficiency, 
and certain genetic disorders. For example, retinol dehydrogenase is a SCAD-famfly member (Simon, 
10 A et al. (1995) J. Biol. Chem. 270:1 107-1 112) that converts retinol to retinal, the precursor of retinoic 
acid. Retinoic acid, a regulator of differentiation and apoptosis, has been shown to down-regulate genes 
involved in cell proliferation and inflammation (Chai, X. et al. (1995) J. Biol. Chem. 270:3900-3904). 
In addition, retinol dehydrogenase has been linked to hereditary eye diseases such as autosomal 
recessive childhood-onset severe retinal dystrophy (Simon, A. et al. (1996) Genomics 36:424-430). 

15 Propagation of nerve impulses, modulation of cell proliferation and differentiation, induction of 

the immune response, and tissue homeostasis involve neurotransmitter metabolism (Weiss, B. (1991) 
Neurotoxicology 12:379-386; Collins, S.M. et al. (1992) Ana N.Y. Acad. Sci. 664:415-424; Brown, 
J.K. and Imam, H. (1991) J. Inherit. Metab. Dis. 14:436-458). Many pathways of neurotransmitter 
metabolism require oxidoreductase activity, coupled to reduction or oxidation of a cofactor, such as 

20 NAD7NADH (Newsholme, E. A and Leech, AR. (1983) Biochemistry for the Medical Sciences . John 
Wiley and Sons, Chichester, U.K. pp. 779-793). Degradation of catecholamines (epinephrine or 
norepinephrine) requires alcohol dehydrogenase (in the brain) or aldehyde dehydrogenase (in peripheral 
tissue). NAD* -dependent aldehyde dehydrogenase oxidizes 5-hydroxyindok>3-acetate (the product of 
5-hydroxytryptamine (serotonin) metabolism) in the brain, blood platelets, liver and pulmonary 

25 endothelium (Newsholme, E. A and Leech, AR. supra p. 786). Other neurotransmitter degradation 
pathways that utilize NADTNADH-dependent oxidoreductase activity include those of L-DOPA 
(precursor of dopamine, a neuronal excitatory compound), glycine (an inhibitory neurotransmitter in the 
brain and spinal cord), histamine (liberated from mast cells during die inflammatory response), and 
taurine (an inhibitory neurotransmitter of the brain stem, spinal cord and retina) (Newsholme, E. A. and 

30 Leech, A.R. supra, pp. 790, 792). Epigenetic or genetic defects in neurotransmitter metabolic pathways 
can result in a spectrum of disease states in different tissues including Parkinson's disease and inherited 
myoclonus (McCance, KX. and Huether, S.E. (1994) Pathophysiology . Mosby-Year Book, Inc., St. 
Louis, MO pp. 402-404; Gundlach, AL. (1990) FASEB J. 4:2761-2766). 

3-Hydroxyacyl-CoA dehydrogenase (3HACD) is involved in fatty acid metabolism. It 

35 catalyzes the reduction of 3-hydroxyacyl-CoA to 3-oxoacyl-CoA, with concomitant oxidation of NAD 
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to NADH, in the mitochondria and peroxisomes of eukaryotic cells. In peroxisomes, 3H ACD and 
enoyl-CoA hydratasc form an enzyme complex called bifunctional enzyme, defects in which are 
associated with peroxisomal bifunctional enzyme deficiency. This interruption in fatty acid metabolism 
produces accumulation of very-long chain fatty acids, disrupting development of the brain, bone, and 
5 adrenal glands. Infants born with this deficiency typically die within 6 months (Watkins, P. et al. 
(1989) J. Clin. Invest. 83:771-777; Online Mendelian Inheritance in Man (OMIM), #261515). The 
neurodegeneration that is characteristic of Alzheimer's disease involves development of extracellular 
plaques in certain brain regions. A major protein component of these plaques is the peptide amyloid-0 
(AP), which is one of several cleavage products of amyloid precursor protein (APP). 3HACD, which 

10 has been shown to bind the AP peptide, is overexpressed in neurons affected in Alzheimer's disease. In 
addition, an antibody against 3HACD can block the toxic effects of Ap in a cell culture model of 
Alzheimer's disease (Yan, S. et al. (1997) Nature 389:689-695; OMIM, #602057). 

1 7p-hydroxysteroid dehydrogenase (1 7PHSD6) plays an important role in the regulation of the 
male reproductive hormone, dihydrotestosterone (DHTT). 17pHSD6 acts to reduce levels of DHTT by 

15 oxidizing a precursor of DHTT, 3a-diol, to androsterone which is readily glucuronidated and removed 
from tissues. 17pHSD6 is active with both androgen and estrogen substrates when expressed in 
embryonic kidney 293 cells. At least five other isozymes of 17PHSD have been identified that catalyze 
oxidation and/or reduction reactions in various tissues with preferences for different steroid substrates 
(Biswas, M.G. and Russell, D.W. (1997) J. Biol. Chem. 272:15959-15966). For example, 17pHSDl 

20 preferentially reduces estradiol and is abundant in the ovary and placenta. 1 7PHSD2 catalyzes 
oxidation of androgens and is present in the endometrium and placenta. 17PHSD3 is exclusively a 
reductive enzyme in the testis (Geissler, W.M. et al. (1994) Nature Genet. 7:34-39). An excess of 
androgens such as DHTT can contribute to certain disease states such as benign prostatic hyperplasia 
and prostate cancer. 

25 Steroids, such as estrogen, testosterone, corticosterone, and others, are generated from a 

common precursor, cholesterol, and are interconverted into one another. A wide variety of enzymes act 
upon cholesterol, including a number of dehydrogenases. One such dehydrogenase is 3-oxo-5-a-steroid 
dehydrogenase (OASD), a microsomal membrane protein highly expressed in prostate and other 
androgen-responsive tissues. OASD catalyzes the conversion of testosterone into dihydrotestosterone, 

30 which is the most potent androgen. Dihydrotestosterone is essential for the formation of the male 

phenotype during embryogenesis, as well as for proper androgen-mediated growth of tissues such as the 
prostate and male genitalia. A defect in OASD that prevents the conversion of testosterone into 
dihydrotestosterone leads to a rare form of male pseudohermaphroditis, characterized by defective 
formation of the external genitalia (Andersson, S., et al. (1991) Nature 354:159-161; Labrie, F., et al. 

35 (1992) Endocrinology 131:1571-1573; OMIM #264600). Thus, OASD plays a central role in sexual 
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differentiation and androgen physiology. 

The discovery of new oxidoreductase prolans and the polynucleotides encoding them satisfies a 
need in the art by providing new compositions which are useful in the diagnosis, prevention, and 
treatment of ceil proliferative disorders including cancer; endocrine, metabolic, reproductive, 
neurological, viral, and autoimmune/inflammatory disorders, and in the assessment of the effects of 
exogenous compounds on the expression of nucleic acid and amino acid sequences of oxidoreductase 
proteins. 



SUMMARY OF THE INVENTION 

The invention features purified polypeptides, oxidoreductase proteins, refaTed to collectively as 
"ORP" and individually as "ORP-1," "ORP-2," "ORP-3," "ORP-4," "ORP-5," "ORP-6," "ORP-7," 
"ORP-8," "ORP-9," "ORP-10," "ORP-1 1 » "ORP-12," "ORP-13," "ORP-14," "ORP-15," "ORP- 
16," "ORP-17," "ORP-18/' "ORP-19," "ORP-20," "ORP-21," "ORP-22," "ORP-23," "ORP-24," 
"ORP-25," "ORP-26," "ORP-27." In one aspect, the invention provides an isolated polypeptide 
comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 -27, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l-27. In one alternative, the invention provides an isolated 
polypeptide comprising the amino acid sequence of SEQ ID NO: 1-27. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising an 
amino acid sequence selected from the group consisting of a) an amino acid sequence selected from the 
group consisting of SEQ ID NO:l-27, b) a naturally occurring amino acid sequence having at least 
90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
27, c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27. In one alternative, the polynucleotide encodes a polypeptide selected 
from the group consisting of SEQ ID NO:l-27. In another alternative, the polynucleotide is selected 
from the group consisting of SEQ ID NO:28-54. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group coasisting of SEQ ID NO: 1-27, c) 
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a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l -27. In one alternative, the invention provides a eel! transformed with the 
recombinant polynucleotide. In another alternative, the invention provides a transgenic organism 
comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, c) 
a biologically active fragment of an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27. The method comprises a) culturing a cell under conditioas suitable for 
expression of the polypeptide, wherein said cell is transformed with a recombinant polynucleotide 
comprising a promoter sequence operably linked to a polynucleotide encoding the polypeptide, and b) 
recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occuuing amino acid 
sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1 -27, and d) an immunogenic fragment of an amino acid 
sequence selected from the group consisting of SEQ ID NO:l-27. 

The invention further provides an isolated polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 contiguous 
nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group coasisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
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an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a probe comprising 
at least 20 contiguous nucleotides comprising a sequence complementary to said target polynucleotide 
in the sample, and which probe specifically hybridizes to said target polynucleotide, under conditions 
whereby a hybridization complex is formed between said probe and said target polynucleotide or 
fragments thereof, and b) (Meeting the presence or absence of said hybridization complex, and 
optionally, if present, the amount thereof. In one alternative, the probe comprises at least 60 contiguous 
nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, said 
target polynucleotide having a sequence of a polynucleotide comprising a polynucleoti de sequence 
selected from the group consisting of a) a polynucleotide sequence selected from the group consisting of 
SEQ ID NO:28-54, b) a naturally occurring polynucleotide sequence having at least 70% sequence 
identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, c) a 
polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary to b), and e) 
an RNA equivalent of a)-d). The method comprises a) amplifying said target polynucleotide or 
fragment thereof using polymerase chain reaction amplification, and b) detecting the presence or 
absence of said amplified target polynucleotide or fragment thereof, and, optionally, if present, the 
amount thereof. 

The invention further provides a composition comprising an effective amount of a polypeptide 
comprising an amino acid sequence selected from the group consisting of a) an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino acid sequence 
having at least 90% sequence identity to an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-27, and a pharmaceutical^ acceptable excipient In one 
embodiment, the composition comprises an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-27. The invention additionally provides a method of treating a disease or condition 
associated with decreased expression of junctional ORP, comprising administering to a patient in need 
of such treatment the compositioa 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 -27, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 -27, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27. The method comprises a) 
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exposing a sample comprising the polypeptide to a compound, and b) detecting agonist activity in the 
sample. In one alternative, the invention provides a composition comprising an agonist compound 
identified by the method and a pharmaceutical^ acceptable excipienl. In another alternative, the 
invention provides a method of treating a disease or condition associated with decreased expression of 
5 functional ORP, comprising administering to a patient in need of such treatment the composition. 

Additionally, the invention provides a method for screening a compound for effectiveness as 
an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

10 selected from the group consisting of SEQ ID NO: 1 -27, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic fragment 
of an amino acid sequence selected from the group consisting of SEQ ID NO: 1 -27. The method 
comprises a) exposing a sample comprising the polypeptide to a compound, and b) delecting 
antagonist activity in the sample. In one alternative, the invention provides a composition comprising 

15 an antagonist compound identified by the method and a pharmaceutical^ acceptable excipient In 
another alternative, the invention provides a method of treating a disease or condition associated with 
overexpression of functional ORP, comprising administering to a patient in need of such treatment the 
composition. 

The invention further provides a method of screening for a compound that specifically binds 
20 to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO:l -27, and d) an immunogenic fragment of an amino acid 
25 sequence selected from the group coasisting of SEQ ID NO: 1 -27. The method comprises a) combining 
the polypeptide with at least one test compound under suitable conditions, and b) detecting binding 
of the polypeptide to the test compound, thereby identifying a compound that specifically binds to the 
polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
30 activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO:l-27, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-27, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-27, and d) an immunogenic 
35 fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27. The 
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method comprises a) combining the polypeptide with at least one test compound under conditions 
permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 
presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
5 change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

The invention further provides a method for screening a compound for effectiveness in 
altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:28-54, the method comprising a) 
10 exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

The invention further provides a method for assessing toxicity of a test compound, said 
method comprising a) treating a biological sample containing nucleic acids with the test compound; 
b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at least 20 
15 contiguous nucleotides of a polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID 
NO:28-54, ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to 
a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, iii) a 
polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Hybridization occurs under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence selected from the group 
consisting of i) a polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, 
ii) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, iii) a 
polynucleotide sequence complementary to i), iv) a polynucleotide sequence complementary to ii), 
and v) an RNA equivalent of i)-iv). Alternatively, the target polynucleotide comprises a fragment of 
a polynucleotide sequence selected from the group consisting of i)- v) above; c) quantifying the 
amount of hybridization complex; and d) comparing the amount of hybridization complex in the 
treated biological sample with the amount of hybridization complex in an untreated biological 
sample, wherein a difference in the amount of hybridization complex in the treated biological sample 
is indicative of toxicity of the test compound. 



BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
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clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding ORP. 

Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 
sequences, and methods, algorithms, and searchable databases used for analysis of ORP. 
5 Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific expression 

patterns of each nucleic acid sequence as determined by northern analysis; diseases, disorders, or 
conditions associated with these tissues; and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding ORP were isolated. 
10 Table 5 shows the t<x>!s. programs, and algorithms used to analyze the polynucleotides and 

polypeptides of the invention, along with applicable descriptions, references, and threshold parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
15 that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
20 and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the art, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same meanings 
25 as commonly understood by one of ordinary skill in the art to which this invention belongs. Although 
any machines, materials, and methods similar or equivalent to those described herein can be used to 
practice or test the present invention, the preferred machines, materials and methods are now described. 
All publications mentioned herein are cited for the purpose of describing and disclosing the cell lines, 
protocols, reagents and vectors which are reported in the publications and which might be used in 
30 connection with the invention. Nothing herein is to be construed as an admission that the invention is 
not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"ORP" refers to the amino acid sequences of substantially purified ORP obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
35 human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

10 
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The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
ORP. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of ORP either by directly interacting with ORP 
or by acting on components of the biological pathway in which ORP participates. 

An "allelic variant" is an alternative form of the gene encoding ORP. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or mere times in 
a given sequence. 

"Altered" nucleic acid sequences encoding ORP include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as ORP or a 
polypeptide with at least one functional characteristic of ORP. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe of 
the polynucleotide encoding ORP, and improper or unexpected hybridization to allelic variants, with a 
locus other than the normal chromosomal locus for the polynucleotide sequence encoding ORP. The 
encoded protein may also be "altered," and may contain deletions, insertions, or substitutions of amino 
acid residues which produce a silent change and result in a functionally equivalent ORP. Deliberate 
amino acid substitutions may be made on the basis of similarity in polarity, charge, solubility, 
hydrophobic^, hydrophilicily, and/or the amphipathic nature of the residues, as long as the biological 
or immunological activity of ORP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, and positively charged amino acids may include lysine and 
arginine. Amino acids with uncharged polar side chains having similar hydrophilicity values may 
include: asparagine and glutamine; and serine and threonine. Amino acids with uncharged side chains 
having similar hydrophilicity values may include: leucine, isoleucine, and valine; glycine and alanine; 
and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid sequence 
to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PGR) technologies well known 
in the art 
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The tenn "antagonist" refers to a molecule which inhibits or attenuates the biological activity of 
ORP. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of ORP either by 
directly interacting with ORP or by acting on components of the biological pathway in which ORP 
5 participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments thereof, 
such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind ORP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
10 to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, or 
synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and keyhole 
limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 

15 makes contact with a particular antibody. When a protein or a fragment of a protein is used to 

immunize a host animal, numerous regions of the protein may induce the production of antibodies which 
bind specifically to antigenic determinants (particular regions or three-dimensional structures on the 
protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen used to 
elicit the immune response) for binding to an antibody. 

20 The term "antisense" refers to any composition capable of base-pairing with the "sense" 

(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2-methoxyethyl sugars or Z-methoxyethoxy sugars; or oligonucleotides having 

25 modified bases such as 5 -methyl cytosine, 2'-deoxyuracil, or 7-deaza-2'-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translatioa The designation "negative" or "minus" can refer to the antisense strand, and the 

30 designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic ORP, or of any oligopeptide thereof, to 
induce a specific immune response in appropriate animals or cells and to bind with specific antibodies. 

35 "Complementary" describes the relationship between two single-stranded nucleic acid 
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sequences that anneal by base-pairing. For example, 5'-AGT-3' pairs with its complement, 
3'-TCA-5'. 

A "composition comprising a given polynucleotide sequence" and a "composition comprising a 
given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding ORP or fragments of ORP may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 
deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (c.g., Denhardt's solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (Applied Biosystems, 
Foster City CA) in the 5' and/or the 3' direction, and resequenced, or which has been assembled from 
one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WT) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that arc predicted to least 
interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows amino 
acids which may be substituted for an original amino acid in a protean and which are regarded as 
conservative amino acid substitutions. 



Original Residue 



Conservative Substitution 



Ala 
Arg 



Gly, Ser 
His, Lys 



Asn 

Asp 

Cys 

Gin 

Glu 

Gly 

His 

He 

Leu 

Lys 

Met 

Phe 

Ser 

Thr 



Cys,Thr 
Ser.Val 
Phe,Tyr 



Arg, Gin, Glu 
Leu, He 

His, Met, Leu, Trp, Tyr 



Leu, Val 
Ile,Val 



Asn, Glu, His 
Asp, Gin, His 
Ala 

Asn, Arg, Gin, Glu 



Asn, Glu 
Ala, Sea- 



Asp, Gin, His 



Trp 
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Tyr His, Phe, Trp 
Val He, Leu, Thr _ 

Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 
backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydrophobicity of the molecule at the site of the substitution, and/or (c) the bulk of the 
side chain. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. Chemical 
modifications of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, hydroxyl, or amino group. A derivative polynucleotide encodes a polypeptide which retains 
at least one biological or immunological function of the natural molecule. A derivative polypeptide is 
one modified by glycosylation, pegylation, or any similar process that retains at least one biological or 
immunological ftinction of the polypeptide from which it was derived. 

A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 
measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of ORP or the polynucleotide encoding ORP which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 
fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 
15, 16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 
from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID N 0:28-54 comprises a region of unique polynucleotide sequence that 
specifically identifies SEQ ID NO:28-54, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:28-54 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish * 
SEQ ID NO:28-54 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:28-54 and the region of SEQ ID NO:28-54 to which the fragment corresponds are routinely 
determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

14 
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A fragment of SEQ ID NO: 1-27 is encoded by a fragment of SEQ ID NO:28-54. A fragment 
of SEQ ID NO: 1-27 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO:l-27. For example, a fragment of SEQ ID NO: 1 -27 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1 -27. The precise length of 
5 a fragment of SEQ ID NO: 1 -27 and the region of SEQ ID NO: 1 -27 to which the fragment 
corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
purpose for the fragment. 

A "full-length" polynucleotide sequence is one containing at least a translation initiation codon 
(e.g., methionine) followed by an open reading frame and a translation termination codon. A "full- 
10 length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 

"Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 
or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer to 
the percentage of residue matches between at least two polynucleotide sequences aligned using a 
15 standardized algorithm Such an algorithm may insert, in a standardized and reproducible way, gaps in 
the sequences being compared in order to optimize alignment between two sequences, and therefore 
achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3. 1 2e sequence 
alignment program This program is part of the LASERGENE software package, a suite of molecular 
biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in Higgins, D.G. 
and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 8:189-191. 
For pairwise alignments of polynucleotide sequences, the default parameters are set as follows: 
Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue weight table is 
selected as the default. Percent identity is reported by CLUSTAL V as the "percent similarity" between 
aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms is 
provided by the National Center for Biotechnology Iriformation (NCBI) Basic Local Alignment Search 
Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403^10), which is available from several 
sources, including the NCBI, Bethesda, MD, and on the Internet at 

http://www.ncbi.rtmnih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively al http^/www.ncbi.iilmnihgov/goriybl2.html. The 

15 



WO 01/44448 PCT/US00/33158 

"BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 
programs are commonly used with gap and other parameters set to default sellings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 

Matrix: BLOSUM62 

Reward for match: 1 

Penalty for mismatch: -2 

Open Gap: 5 and Extension Gap: 2 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 11 

Filter: on 

Percent identity may be measured over the length of an entire deflned sequence, for example, as 
defined by a particular SEQ ID number, or may be measured ova* a shorter length, for example, over 
the length of a fragment taken from a larger, defined sequence, for instance, a fragmeni of at least 20, at 
least 30, at least 40, at least 50, at least 70, at least 1 00, or at least 200 contiguous nucleotides. Such 
lengths are exemplary only, and il is understood that any fragment length supported by the sequences 
shown herein, in the tables, figures, or Sequence Listing, may be used to describe a length over which 
percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes in 
a nucleic acid sequence can be mack using this degeneracy to produce multiple nucleic acid sequences 
that all encode substantially the same protein. 

The phrases "percent identity" and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some alignment 
methods take into account conservative amino acid substitutions. Such conservative substitutions, 
explained in more detail above, generally preserve the charge and hydrophobicity at the site of 
substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Pa-cent identity between polypeptide sequences may be determined using the default parameters 
of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3. 1 2e sequence alignment 
program (described and referenced above). For pairwise alignments of polypeptide sequences using 
CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap penalty=3, window=5, and 
"diagonals saved"=5. The PAM250 matrix is selected as the default residue weight table. As with 
polynucleotide alignments, the percent identity is reported by CLUSTAL V as the "percent similarity" 
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between aligned polypeptide sequence pairs. 

Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 
comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 2.0. 12 
(Apr-21 -2000) with blastp set at default parameters. Such default parameters may be, for example: 
5 Matrix: BLOSUM62 

Open Cap: 11 and Extension Cap: 1 penalties 

Gap x drop-off: 50 

Expect: 10 

Word Size: 3 
10 Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, for 
example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for instance, 
a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 150 
contiguous residues. Such lengths are exemplary only, and it is understood that any fragment length 
supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be used to 
describe a length over which percentage identity may be measured 

"Human artificial chromosomes" (HACs) are linear miaochromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the stringency 
of the hybridization process, with more stringent conditions allowing less non-specific binding, i.e., 
binding between pairs of nucleic acid strands that are not perfectly matched. Permissive conditions for 
annealing of nucleic acid sequences are routinely determinable by one of ordinary skill in the art and 
may be consistent among hybridization experiments, whereas wash conditions may be varied among 
experiments to achieve the desired stringency, and therefore hybridization specificity. Permissive 
annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 1 % (w/v) 
SDS, and about 100 ug/ml sheared, denatured salmon sperm DNA. 
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Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5T to 20°C lower than the thermal melting point (T ln ) for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of the 
5 target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and conditions 
for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 1989, Molecular 
Cloning: A Laboratory Manual. 2 nd ed, vol. 1-3, Cold Spring Harbor Press, Plainview NY; specifically 
see volume 2, chapter 9. 

High stringency conditions for hybridization between polynucleotides of the present invention 

10 include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0. 1 % SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0. 1 to 2 x SSC, with SDS being present at about 0. 1 %. Typically, blocking 
reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 jig/ml. Organic solvent, such as 

15 formamide at a concentration of about 35-50% v/v, may also be used undo- particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

20 The term "hybridization complex" refers to a complex formed between two nucleic acid 

sequences by virtue of the formation of hydrogen bonds between complementary bases. A hybridization 
complex may be formed in solution (e.g., C 0 t or R^t analysis) or formed between one nucleic acid 
sequence present in solution and another nucleic acid sequence immobilized on a solid support (e.g., 
paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which cells 

25 or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide sequence 
resulting in the addition of one or more amino acid residues or nucleotides, respectively. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression of 

30 various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect cellular 
and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment of ORP whifch is 
capable of eliciting an immune response when introduced into a living organism, for example, a 
mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment of 

35 ORP which is useful in any of the antibody production methods disclosed herein or known in the art. 
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The term 4t microarray" refers to an arrangement of a plurality of polynucleotides, polypeptides, 
or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of ORP. For example, modulation may 
cause an increase or a decrease in protein activity, binding characteristics, or any other biological, 
functional, or immunological properties of ORP. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide, 
polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
Junctional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and, where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. PNAs 
preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, and 
may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an ORP may involve lipidation, glycosylation, 
phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in the 
art. These processes may occur synthetically or biochemically. Biochemical modifications will vary by 
cell type depending on the enzymatic milieu of ORP. 

"Probe" refers to nucleic acid sequences encoding ORP, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. Typical 
labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. "Primers" are 
short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target polynucleotide by 
complementary base-pairing. The prima- may then be extended along the target DNA strand by a DNA 
polymerase enzyme. Primer pairs can be used for amplification (and identification) of a nucleic acid 
sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
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be employed, such as probes and primers that comprise at least 20, 25, 30, 40, 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers may 
be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 
5 Methods for preparing and using probes and primers are described in the references, for 

example Sambrook, J. et al. (1989) Molecular Cloning: A Laboratory Manual . 2 nd ed, vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubei, F.M. et al. (1987) Current Protocols in Molecular 
Biology. Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al. (1990) PCR 
Protocols. A Guide to Methods and Applications . Academic Press, San Diego CA. PCR primer pairs 
10 can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 
MA). 

Oligonucleotides for use as primers are selected using software known in the art lor such 
purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 

15 1 00 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 5,000 
nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer selection 
programs have incorporated additional features for expanded capabilities. For example, the PrimOU 
primer selection program (available to the public from the Genome Center at University of Texas South 
West Medical Cento", Dallas TX) is capable of choosing specific primers from megabase sequences 

20 and is thus useful for designing primers on a genome-wide scope. The Primer3 primer selection 
program (available to the public from the Whitehead Institute/MIT Cento- for Genome Research, 
Cambridge MA) allows the user to input a "misprinting library," in which sequences to avoid as primer 
binding sites are user-specified. Primer3 is useful, in particular, for the selection of oligonucleotides for 
microarrays. (The source code for the latter two primer selection programs may also be obtained from 

25 their respective sources and modified to meet the user's specific needs.) The PrimeGen program 
(available to the public from the UK Human Genome Mapping Project Resource Centre, Cambridge 
UK) designs primers based on multiple sequence alignments, thereby allowing selection of primers that 
hybridize to either the most conserved or least conserved regions of aligned nucleic acid sequences. 
Hence, this program is useful for identification of both unique and conserved oligonucleotides and 

30 polynucleotide fragments. The oligonucleotides and polynucleotide fragments identified by any of the 
above selection methods are useful in hybridization technologies, for example, as PCR or sequencing 
primers, microarray dements, or specific probes to identify fully or partially complementary 
polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection are not limited to 
those described above. 

35 A "recombinant nucleic acid" is a sequence that is not naturally occurring cm- has a sequence 

20 
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that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g„ by genetic engineering techniques 
such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
5 been altered solely by addition, substitution, or deletion of a portion of the nucleic acid Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter sequence. 
Such a recombinant nucleic acid may be part of a vector that is used, for example, to transform a cell. 

Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 
vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 
10 expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5* and 3* untranslated regions (UTRs). 
Regulatory elements interact with host or viral proteins which control transcription, translation, or RNA 
stability. 

15 "Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 

amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; ami 
other moieties known in the art. 

An "RNA equivalent," in reference to a DN A sequence, is composed of the same linear 
20 sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding ORP, or fragments thereof, or ORP itself, may comprise a bodily iluid; an extract from 
25 a cell, chromosome, organelle, or membrane isolated from a cell; a cdl; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular structure 
30 of the protein, e.g., the antigenic determinant or epitope, recognized by the binding molecule. For 

example, if an antibody is specific for epitope "A," the presence of a polypeptide comprising the epitope 
A, or the presence of free unlabeled A, in a reaction containing free labeled A and the antibody will 
reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
35 removed from their natural environment and are isolated or separated, and are at least 60% free, 
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preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides by 
different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell type 
or tissue under given conditions at a given time. 

"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods well 
known in the art, and may rely on any known method for the insertion of foreign nucleic acid sequences 
into a prokaryotic or eukaryotic host cell. The method for transformation is sdected based on the type 
of host ceil bang transformed and may include, but is not limited to, bacteriophage or viral infection, 
eiectroporation, heat shock, lipofection, and particle bombardment. The term "transformed" cells 
includes stably transformed cells in which the inserted DNA is capable of replication either as an 
autonomously replicating plasmid or as part of the host chromosome, as well as transiently transformed 
cells which express the inserted DNA or RNA for limited periods of time. 

A "transgenic organism," as used herein, is any organism, including but not limited to 
animals and plants, in which one or moie of the cells of the organism contains heterologous nucleic 
acid introduced by way of human intervention, such as by transgenic techniques well known in the 
art. The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor 
of the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with 
a recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
■Vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 
transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook, J. etal. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having at 
least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of the 
nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
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set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or greater 
sequence identity ova- a certain defined length. A variant may be described as, for example, an "allelic" 
(as defined above), "splice," "species," or "polymorphic" variant. A splice variant may have significant 
5 identity to a reference molecule, but will generally have a greater or lesser number of polynucleotides 
due to alternative splicing of exons during mRNA processing. The corresponding polypeptide may 
possess additional functional domains or lack domains that are present in the reference molecule. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A polymorphic 
10 variant is a variation in the polynucleotide sequence of a particular gene between individuals of a given 
species. Polymorphic variants also may encompass "single nucleotide polymorphisms" (SNPs) in 
which the polynucleotide sequence varies by one nucleotide base. The presence of SNPs may be 
indicative of, for example, a certain population, a disease state, or a propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having at 

15 least 40% sequence identity to the particular polypeptide sequence over a certain length of one of the 
polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07-1999) 
set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at least 
60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

20 THE INVENTION 

The invention is based on the discovery of new human oxidoreductase proteins (ORP), the 
polynucleotides encoding ORP, and the use of these compositions for the diagnosis, treatment, or 
prevention of cell proliferative disorders including cancer; endocrine, metabolic, reproductive, 
neurological, viral, and autoimmune/inflammatory disorders. 

25 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding ORP. 

Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide and 
nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which nucleic 
acids encoding each ORP were identified, and column 4 shows the cDNA libraries from which these 
clones were isolated Column 5 shows Incyte clones and their corresponding cDNA libraries. Clones 

30 for which cDNA libraries arc not indicated were derived from pooled cDNA libraries. In some cases, 
GenBank sequence identifiers are also shown in column 5. The Incyte clones and GenBank cDNA 
sequences, where indicated, in column 5 were used to assemble the consensus nucleotide sequence of 
each ORP and are useful as fragments in hybridization technologies. 

The columns of Table 2 show various properties of each of the polypeptides of the invention: 

35 column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
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polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylate 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical methods 
and in some cases, searchable databases to which the analytical methods were applied. The methods of 
5 column 7 were used to characterize each polypeptide through sequence homology and protein motifs. 
The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding ORP. The first column of Table 3 lists the nucleotide 
SEQIDNOs. Column 2 lists fragments of the nucleotide sequences of column 1. These fragments are 
useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:28-54 and 

10 to distinguish between SEQ ID NO:28-54 and related polynucleotide sequences. The polypeptides 
encoded by these fragments are useful, for example, as immunogenic peptides. Column 3 lists tissue 
categories which express ORP as a fraction of total tissues expressing ORP. Column 4 lists diseases, 
disorders, or conditions associated with those tissues expressing ORP as a fraction of total tissues 
expressing ORP. Column 5 lists the vectors used to subclone each cDNA library. 

15 The columns of Tabic 4 show descriptions of the tissues used to construct the cDNA libraries 

from which cDNA clones encoding ORP were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDN A libraries in column 2. 

SEQ ID NO:31 maps to chromosome 3 within the interval from 23.2 to 31.4 centiMorgans. 

20 SEQ ID NO:42 maps to chromosome 22 within the interval from 0 to 40.2 centiMorgans. SEQ ID 
NO:48 maps to chromosome 7 within the interval from 100.5 to 1 14.5 centiMorgans, to chromosome 
17 within the interval from 67.6 to 69.3 centiMorgans, and to chromosome 17 within the interval 
from 83.8 centiMorgans to the q terminus. SEQ ID NO:53 maps to chromosome 1 1 within the 
interval from 64.9 to 70.9 centiMorgans. 

25 The invention also encompasses ORP variants. A preferred ORP variant is one which has at 

least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid sequence 
identity to the ORP amino acid sequence, and which contains at least one functional or structural 
characteristic of ORP. 

The invention also encompasses polynucleotides which encode ORP. In a particular 

30 embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected from 
the group consisting of SEQ ID NO:28-54, which encodes ORP. The polynucleotide sequences of SEQ 
ID NO:28-54, as presented in the Sequence Listing, embrace the equivalent RN A sequences, wherein 
occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone is 
composed of ribose instead of deoxyribose. 

35 The invention also encompasses a variant of a polynucleotide sequence encoding ORP. In 
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particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at least 
about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide sequence 
encoding ORP. A particular aspect of the invention encompasses a variant of a polynucleotide 
sequence comprising a sequence selected from the group consisting of SEQ ID NO:28-54 which has at 
5 least about 70%, or alternatively at least about 85%, or even at least about 95% polynucleotide 
sequence identity to a nucleic acid sequence selected from the group consisting of SEQ ID NO.28-54. 
Any one of the polynucleotide variants described above can encode an amino acid sequence which 
contains at least one functional or structural characteristic of ORP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the genetic 
1 0 code, a multitude of polynucleotide sequences encoding ORP, some bearing minimal similarity to the 
polynucleotide sequences of any known and naturally occurring gene, may be produced. Thus, the 
invention contemplates each and every possible variation of polynucleotide sequence that could be made 
by selecting combinations based on possible codon choices. These combinations arc made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence of naturally 

15 occurring ORP, and all such variations are to be considered as being specifically disclosed. 

Although nucleotide sequences which encode ORP and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring ORP under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding ORP or its 
derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally occurring 

20 codons. Codons may be selected to increase the rale at which expression of the peptide occurs in a 

particular prokaryotic or eukaryotic host in accordance with the frequency with which particular codons 
are utilized by the host. Other reasons for substantially altering the nucleotide sequence encoding ORP 
and its derivatives without altering the encoded amino acid sequences include the production of RNA 
traascripts having more desirable properties, such as a greater half-life, than transcripts produced from 

25 the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode ORP and ORP 
derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the synthetic 
sequence may be inserted into any of the many available expression vectors and cell systems using 
reagents well known in the art. Moreover, synthetic chemistry may be used to introduce mutations into 

30 a sequence encoding ORP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:28-54 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 

35 1 52:507-5 1 1 .) Hybridization conditions, including annealing and wash conditions, are described in 
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"Definitioas." 

Methods for DNA sequencing are well known in the art and may be used to practice any of the 
embodiments of the inventioa The methods may employ such enzymes as the Klenow fragment of 
DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (Applied 
Biosystems, Foster City C A), thermostable T7 polymerase (Amersham Pharmacia Biotech, Piscataway 
NJ), or combinatioas of polymerases and proofreading exonucleases such as those found in the 
ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, sequence 
preparation is automated with machines such as the MICROLAB 2200 liquid transfer system 
(Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI CATALYST 
800 thermal cycler (Applied Biosystems). Sequencing is then carried out using either the ABI 373 or 
377 DNA sequencing system (Applied Biosystems), the MEGABACE 1000 DNA sequencing system 
(Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting sequences are 
analyzed using a variety of algorithms which are wdl known in the art. (See, e.g., Ausubel, F.M. 
0997) Short Protoc ols in Molecular Biology . John Wiley & Sons, New York NY, unit 7.7; Meyers, 
R.A. (1995) Molecular Biology and Biotechnology . Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding ORP may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 
restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, c.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising a 
known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 
Res. 16:8186.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 
are known in the art (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 
Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
Alto C A) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using \ 
commercially available software, such as OLIGO 4.06 Prima* Analysis software (National Biosciences, 
Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in length, to have a 
GC content of about 50% or more, and to anneal to the template at temperatures of about 68°C to 
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72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) library 
does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence into 5' 
non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze the 
size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 
specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, Applied Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 
which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode ORP may be cloned in recombinant DNA molecules that direct expression of ORP, or fragments 
or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy of the 
genetic code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be produced and used to express ORP. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter ORP-encoding sequences for a variety of purposes including, but not 
limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 
oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleolide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction sites, 
alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECUL ARB REEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 
5,837,458; Chang, C.-C et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C et al. (1999) Nat 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
improve the biological properties of ORP, such as its biological or enzymatic activity or its ability to 
bind to other molecules or compounds. DNA shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
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properties. These preferred variants may then be pooled and farther subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
5 optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
homologous genes in the same gene family, either from the same or different species, thereby 
maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding ORP may be synthesized, in whole or in part, 

10 using chemical methods well known in the art (See, e.g., Caruthers, M.H. et ah (1980) Nucleic Acids 
Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
ORP itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. (1984) Proteins. Structures and Molecular Properties. WH Freeman, New York NY, pp. 

15 55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 
using the ABI 43 1 A peptide synthesizer (Applied Biosystems). Additionally, the amino acid sequence 
of ORP, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

20 The peptide may be substantially purified by preparative high performance liquid 

chromatography. (See, e.g., Chiez, R.M and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active ORP, the nucleotide sequences encoding ORP or 

25 derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contaias 
the necessary elements for transcriptional and translational control of the inserted coding sequence in a 
suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3* untranslated regions in the vector and in polynucleotide sequences 
encoding ORP. Such elements may vary in their strength and specificity. Specific initiation signals 

30 may also be used to achieve more efficient translation of sequences encoding ORP. Such signals 
include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 
sequences encoding ORP and its initiation codon and upstream regulatory sequences are inserted into 
the appropriate expression vector, no additional transcriptional or translational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 

35 translational control signals including an in-frame ATG initiation codon should be provided by the 
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vector. Exogenous translational elements and initiation codons may be of various origins, both natural 
and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers appropriate 
for the particular host cell system used (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding ORP and appropriate transcriptional and translational control 
elements. These methods include in vitro recombinant DN A techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A Laboratory 
Manual, Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols in Mole cular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding ORP. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra : Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. 
Acad. Sci. USA 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, 
N. (1987) EMBO J. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et d. 
(1984) Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The 
McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 
191-196; Logan, J. and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81 :3655-3659; and Harrington, 
J.J. et al. (1997) Nat Genet 15:345-355.) Expression vectors derived from retroviruses, 
adenoviruses, or herpes or vaccinia viruses, or from various bacterial plasmids, may be used for 
delivery of nucleotide sequences to the targeted organ, tissue, or cell population. (See, e.g., Di 
Nicola, M. et al. (1998) Cancer Gen. Ther. 5(6):350-356; Yu, M. et al. (1993) Proc. Natl. Acad. Sci. 
USA 90(13):6340-6344; Buller, R.M. et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. 
(1994) Mol. Immunol. 31(3):219-226; and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) 
The invention is not limited by the host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding ORP. For example, routine cloning, 
subcloning, and propagation of polynucleotide sequences encoding ORP can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla C A) or PSPORT1 plasmid 
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(Life Technologies). Ligation of sequences encoding ORP into the vector's multiple cloning site 
disrupts the lacL gene, allowing a colorimetric screening procedure for identification of transformed 
bacteria containing recombinant molecules. In addition, these vectors may be useful for in vitro 
transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of nested 
5 deletions in the cloned sequence. (See, e.g., Van Heekc, G. and S.M. Schuster (1989) J. Biol. Chem. 
264:5503-5509.) When large quantities of ORP are needed, e.g. for the production of antibodies, 
vectors which direct high level expression of ORP may be used For example, vectors containing the 
strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of ORP. A number of vectors containing 
constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH promoters, may be 
used in the yeast Saccharomvc es cerevisiae or Pichia nastoris . In addition, such vectors direct either the 
secretion or intracellular mention of expressed proteins and enable integration of foreign sequences into 
the host genome for stable propagation. (See, e.g., Ausubel, 1995, supra : Bitter, supra : and Scorer, 
supra .) 

Plant systems may also be used for expression of ORP. Transcription of sequences encoding 
ORP may be driven viral promoters, eg., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 
Alternatively, plant promotes such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g., Coruzzi, supra : Broglie, supra : and Winter, supra .) These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated traasfection. (See, e.g., 
The McGraw Hill Yearbook of Science and Technology (1 992) McGraw Hill, New York NY, pp. 
191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 
where an adenovirus is used as an expression vector, sequences encoding ORP may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses ORP in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. Natl. 
Acad Sci. USA 81 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma virus 
(RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EB V-based 
vectors may also be used for high-level prolan expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA Uian can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 
constructed and delivered via conventional delivery methods (liposomes, polycationic amino polymers, 
or vesicles) for therapeutic purposes. (See, e.g., Harrington, J J. et al. (1997) Nat. Genet. 15:345-355.) 
For long term production of recombinant proldas in mammalian systems, stable expression of 
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ORP in cell lines is preferred For example, sequences encoding ORP can be transformed into cell lines 
using expression vectors which may contain viral origins of replication and/or endogenous expression 
dements and a selectable marker gene on the same or on a separate vector. Following the introduction 
of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media before being 
switched to selective media. The purpose of the selectable marker is to confer resistance to a selective 
agent, and its presence allows growth and recovery of cells which successfully express the introduced 
sequences. Resistant clones of stably transformed cells may be propagated using tissue culture 
techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These include, 
but are not limited to, the herpes simplex vims thymidine kinase and adenine phosphoribosyltransferase 
genes, for use in tk and apr ceils, respectively. (See, e.g., Wigler, M. et at. (1977) Cell 1 1:223-232; 
Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selcctioa For example, dhfr confers resistance to methotrexate; neo confers 
resistance to the aminoglycosides neomycin and G-418; and als and pat confer resistance to 
15 chlorsulfuron and phosphinotricin acetyltransferase, respectively. (See, e.g., Wigler, M. et al. (1980) 
Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbcre-Garapin, F. et al. (1981) J. Mol. Biol. 150:1-14.) 
Additional selectable genes have been described, e.g., trpB and hisD, which alter cellular requirements 
for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 
85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B 
glucuronidase and its substrate B-glucuronide, or luciferase and its substrate luciferin may be used. 
These markers can be used not only to identify transformants, but also to quantify the amount of 
transient or stable protein expression attributable to a specific vector system. (See, e.g., Rhodes, C. A. 
(1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed For example, if the 
sequence encoding ORP is inserted within a marker gene sequence, transformed cells containing 
sequences encoding ORP can be identified by the absence of marker gene functioa Alternatively, a 
marker gene can be placed in tandem with a sequence encoding ORP under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding ORP and that express 
ORP may be identified by a variety of procedures known to those of skill in the art. These procedures 
include, but are not limited to, DNA-DNA or DNA-RNA hybridizatioas, PCR amplification, and 
protein bioassay or immunoassay techniques which include membrane, solution, or chip based 
technologies for the detection and/or quantification of nucleic acid or protein sequences. 
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Immunological methods for detecting and measuring the expression of ORP using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques include 
enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and fluorescence 
activated ceil sorting (FACS). A two-site, monoclonal-based immunoassay utilizing monoclonal 
antibodies reactive to two non-interfering epitopes on ORP is preferred, but a competitive binding assay 
may be employed. These and other assays are well known in the art (See, e.g., Hampton, R. et al. 
(1990) Serological Methods, a Laboratory Manual . APS Press, St. Paul MN, Sect IV; Coligan, J.E. et 
al. (1997) Current Protocols in Immunology , Greene Pub. Associates and Wiley-Interscience, New 
York NY; and Pound, J.D. (1998) Immunochemical Protocols . Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled hybridization 
or PCR probes for detecting sequences related to polynucleotides encoding ORP include oligolabeling, 
nick translation, end-labeling, or PCR amplification using a labeled nucleotide. Alternatively, the 
sequences encoding ORP, or any fragments thereof, may be cloned into a vector for the production of 
an mRNA probe. Such vectors are known in the art, arc commercially available, and may be used to 
synthesize RNA probes in vitro by addition of an appropriate RNA polymerase such as T7, T3, or SP6 
and labeled nucleotides. These procedures may be conducted using a variety of commercially available 
kits, such as those provided by Amersham Pharmacia Biotech, Promega (Madison WI), and US 
Biochemical. Suitable reporter molecules or labels which may be used for ease of detection include 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, 
cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding ORP may be cultured under 
conditions suitable for the expression and recovery of the protein from cell culture. The protein 
produced by a transformed cell may be secreted cm- retained intracellular^ depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors containing 
polynucleotides which encode ORP may be designed to contain signal sequences which direct secretion 
of ORP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashioa Such modifications of the 
polypeptide include, but are not limited to, aeetylation, carboxylation, glycosylate, phosphorylation, 
lipidation, and acylation. Post-translational processing which cleaves a "prepro" or "pro" form of the 
protein may also be used to specify protein targeting, folding, and/or activity. Different ho&t cells 
which have specific cellular machinery and characteristic mechanisms for post-translational activities 
(e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the American Type Culture 
Collection (ATCC, Manassas VA) and may be chosen to ensure the correct modification and processing 
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of the foreign protda 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding ORP may be ligated to a heterologous sequence resulting in translation of a fusion 
protein in any of the aforementioned host systems. For example, a chimeric ORP protein containing a 
heterologous moiety that can be recognized by a commercially available antibody may facilitate the 
screening of peptide libraries for inhibitors of ORP activity. Heterologous protein and peptide moieties 
may also facilitate purification of fusion proteins using commercially available affinity matrices. Such 
moieties include, but are not limited to. glutathione S-transferase (GST), maltose binding protein 
(MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin 
(HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate fusion proteins on 
immobilized glutathione, maltose, phcnylarsine oxide, calmodulin, and metal-chelate resins, 
respectively. FLAG, c-myc, and hemagglutinin (HA) enable inuminoaffinity purification of fusion 
proteins using commercially available monoclonal and polyclonal antibodies that specifically recognize 
these epitope tags. A fusion protein may also be engineered to contain a proteolytic cleavage site 
15 located between the ORP encoding sequence and the heterologous protein sequence, so that ORP may 
be cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995, supra, ch. 10). A variety of commercially 
available kits may also be used to facilitate expression and purification of fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled ORP may be achieved in 
20 vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These systems 
couple transcription and translation of protein-coding sequences operably associated with the 17, T3, or 
SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid precursor, for 
example, 35 S-methionine. 

ORP of the present invention or fragments thereof may be used to screen for compounds that 
specifically bind to ORP. At least one and up to a plurality of test compounds may be screened for 
specific binding to ORP. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g., receptors), or small molecules. 

In one embodiment, the compound thus identified is closely related to the natural ligand of 
ORP, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, e.g., Coligan, J£. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which ORP 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express ORP, either as a secreted 
protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosonhila. or E. 
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coli. Cells expressing ORP or cell membrane fractions which contain ORP are then contacted with a 
test compound and binding, stimulation, or inhibition of activity of either ORP or the compound is 
analyzed. 

An assay may simply test binding of a test compound to the polypeptide, wherein binding is 
5 detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label For example, 
the assay may comprise the steps of combining at least one test compound with ORP, either in 
solution or affixed to a solid support, and detecting the binding of ORP to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
labeled competitor. Additionally, the assay may be carried out using cell-free preparatioas, chemical 
10 libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

ORP of the present invention or fragments thereof may be used to screen for compounds that 
modulate the activity of ORP. Such compounds may include agonists, antagonists, or partial or 
inverse agonists. In one embodiment, an assay is performed under conditions permissive for ORP 

15 activity, wherein ORP is combined with at least one test compound, and the activity of ORP in the 
presence of a test compound is compared with the activity of ORP in the absence of the test 
compound. A change in the activity of ORP in the presence of the test compound is indicative of a 
compound that modulates the activity of ORP. Alternatively, a test compound is combined with an in 
vitro or cell-free system comprising ORP under conditions suitable for ORP activity, and the assay is 

20 pa-formed. In either of these assays, a test compound which modulates the activity of ORP may do 
so indirectly and need not come in direct contact with the test compound. At least one and up to a 
plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding ORP or their mammalian homologs may 
be "knocked out" in an animal model system using homologous recombination in embryonic stem 

25 (ES) cells. Such techniques arc well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and ILS. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 1 29/SvJ cell line, are derived from the early mouse ; 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 
interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, MR. 

30 (1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 
Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

35 blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
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to pseudopregnant dams, and Ihe resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding ORP may also be manipulated in vitro in ES cells derived from 
human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282: 1 145-1 147). 

Polynucleotides encoding ORP can also be used to create "knockin" humanized animals 
(pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 
region of a polynucleotide encoding ORP is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulac are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 
Alternatively, a mammal inbred to overexpress ORP, e.g., by secreting ORP in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) Biotechnol. Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of ORP and oxidoreductasc proteins. In addition, the expression of ORP is closely 
associated with cancerous, inflamed, reproductive, and gastrointestinal tissues. Therefore, ORP 
appears to play a role in cell proliferative disorders including cancer; endocrine, metabolic, 
reproductive, neurological, viral, and autoimmune/inflammatory disorders. In the treatment of 
disorders associated with increased ORP expression or activity, it is desirable to decrease the 
expression or activity of ORP. In the treatment of disorders associated with decreased ORP 
expression or activity, it is desirable to increase the expression or activity of ORP. 

Therefore, in one embodiment, ORP or a fragment or derivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of ORP. 
Examples of such disorders include, but are not limited to, a cell proliferative disorder, such as actinic 
keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue 
disease, myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia; a cancer, such as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, 
sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone 
marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, 
muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, 
thyroid, and uterus; an endocrine disorder, such as disorders of the hypothalamus and pituitary 
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resulting from lesions such as primary brain tumors, adenomas, infarction associated with pregnancy, 
hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, immunological 
disorders, and complications due to head trauma, disorders associated with hypopituitarism including 
hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, Hand-Schuller-Christian 
5 disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and dwarfism, disorders associated 
with hyperpituitarism including acromegaly, giantism, and syndrome of inappropriate antidiuretic 
hormone (ADH) secretion (SI ADH) often caused by benign adenoma, disorders associated with 
hypothyroidism including goiter, myxedema, acute thyroiditis associated with bacterial infection, 
subacute thyroiditis associated with viral infection, autoimmune thyroiditis (Hashimoto's disease), 
10 and cretinism, disorders associated with hyperthyroidism including thyrotoxicosis and its various 
forms, Grave's disease, prclibial myxedema, toxic multinodular goiter, thyroid carcinoma, and 
Plummer's disease, disorders associated with hyperparathyroidism including Conn disease (chronic 
hypercalemia), pancreatic disorders such as Type I or Type II diabetes mellitus and associated 
complications, disorders associated with the adrenals such as hyperplasia, carcinoma, or adenoma of 

15 the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, Cushing's 
disease, Liddle's syndrome, and Arnold-Hcaly-Gordon syndrome, pheochromocytoma tumors, and 
Addison's disease, disorders associated with gonadal steroid hormones such as: in women, abnormal 
prolactin production, infertility, endometriosis, perturbations of the menstrual cycle, polycystic 
ovarian disease, hypeiprolactinemia, isolated gonadotropin deficiency, amenorrhea, galactorrhea, 

20 hermaphroditism, hirsutism and virilization, breast cancer, and, in postmenopausal women, 

osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell aplasia, 
hypergonadal disorders associated with Leydig cell tumors, androgen resistance associated with 
absence of androgen receptors, syndrome of 5 a-reductase, and gynecomastia; a metabolic disorder, 
such as Addison's disease, cystic fibrosis, diabetes, fatty hepatocirrhosis, galactosemia, goiter, 

25 hyperadrenalism, hypoadrenalism, hyperparathyroidism, hypoparathyroidism, hypercholesterolemia, 
hyperthyroidism, hypothyroidism hyperlipidemia, hyperlipemia, lipid myopathies, obesity, 
lipodystrophies, and phenylketonuria, congenital adrenal hyperplasia, pseudovitamin D-defidency 
rickets, cerebrotendinous xanthomatosis, and coumarin resistance; a reproductive disorder, such as 
disorders of prolactin production, infertility, including tubal disease, ovulatory defects, and 

30 endometriosis, disruptions of the estrous cycle, disruptions of the menstrual cycle, polycystic ovary 
syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian tumors, uterine fibroids, 
autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of the breast, fibrocystic breast 
disease, and galactorrhea, disruptions of spermatogenesis, abnormal sperm physiology, cancer of the 
testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, 

35 carcinoma of the male breast, and gynecomastia; a neurological disorder, such as epilepsy, ischemic 
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cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, amyotrophic 
lateral sclerosis and other motor neuron disorders, progressive neural muscular atrophy, retinitis 
pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating diseases, bacterial and viral 
5 meningitis, brain abscess, subdural empyema, epidural abscess, suppurative intracranial 

thrombophlebitis, myelitis and radiculitis, viral central nervous system disease, prion diseases 
including kuru, Creutzfeldt- Jakob disease, and Gerstmann-Straussler-Scheinker syndrome, fatal 
familial insomnia, nutritional and metabolic diseases of the nervous system, neurofibromatosis, 
tuberous sclerosis, cerebelloretinal hcmangioblastomatosis, encephalotrigeminal syndrome, mental 

10 retardation and other developmental disorders of the central nervous system, cerebral palsy, 

neuroskeletal disorders, autonomic nervous system disorders, cranial nerve disorders, spinal cord 
diseases, muscular dystrophy and other neuromuscular disorders, peripheral nervous system disorders, 
dermatomyositis and polymyositis, inherited, metabolic, endocrine, and toxic myopathies, myasthenia 
gravis, periodic paralysis, mental disorders including mood, anxiety, and schizophrenic disorders, 

15 seasonal affective disorder (SAD), akathesia, amnesia, catatonia, diabetic neuropathy, tardive 
dyskinesia, dystonias, paranoid psychoses, postherpetic neuralgia, and Tourette's disorder; an 
autoimmune/inflammatory disorder, such as acquired immunodeficiency syndrome (AIDS), 
Addison's disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, 
anemia, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 

20 autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, 
cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes 
mellitus, emphysema, episodic lymphopenia with lymphocylotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' 
disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 

25 myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, pancreatitis, 
polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, Sjogren's syndrome, 
systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, thrombocytopenic purpura, 
ulcerative colitis, uveitis, Werner syndrome, complications of cancer, hemodialysis, and 
extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and helminthic infections, and 

30 trauma; and a viral disorder, such as viral infections, e.g., those caused by adenoviruses (acute 
respiratory disease, pneumonia), arenaviruses (lymphocytic choriomeningitis), bunyaviruses 
(Hantavirus), coronaviruses (pneumonia, chronic bronchitis), hepadnaviruses (hepatitis), 
herpesviruses (herpes simplex vims, varicella-zoster virus, Epstein-Barr virus, cytomegalovirus), 
flaviviruses (yellow fever), orthomyxoviruses (influenza), papillomaviruses (cancer), 

35 paramyxoviruses (measles, mumps), picornoviruses (rhinovirus, poliovirus, coxsackie-virus), 
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polyoma viruses (BK virus, JC virus), poxviruses (smallpox), reovirus (Colorado tick fever), 
retroviruses (human immunodeficiency virus, human T lymphotropic virus), rhabdoviruses (rabies), 
rotaviruses (gastroenteritis), and togaviruses (encephalitis, rubella). 

In another embodiment, a vector capable of expressing ORP or a fragment or derivative thereof 
5 may be administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of ORP including, but not limited to, those described above. 

In a further embodiment, a composition comprising a substantially purified ORP in conjunction 
with a suitable pharmaceutical carrier may be administered to a subject to treat or prevent a disorder 
associated with decreased expression or activity of ORP including, but not limited to, those provided 
10 above. 

In still another embodiment, an agonist which modulates the activity of ORP may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or activity 
of ORP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of ORP may be administered to a subject to treat or 

15 prevent a disorder associated with increased expression or activity of ORP, Examples of such 

disorders include, but are not limited to, those cell proliferative disorders including cancer; endocrine, 
metabolic, reproductive, neurological, viral, and autoimmune/inflammatory disorders described above. 
In one aspect, an antibody which specifically binds ORP may be used directly as an antagonist or 
indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells or tissues 

20 which express ORP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding ORP may be administered to a subject to treat or prevent a disorder associated with increased 
expression or activity of ORP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 

25 sequences, or vectors of the invention may be administered in combination with other appropriate 

therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made by 
one of ordinary skill in the art, according to conventional pharmaceutical principles. The combination 
of therapeutic agents may act synergistically to effect the treatment or prevention of the various 
disorders described above. Using this approach, one may be able to achieve therapeutic efficacy with 

30 lower dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of ORP may be produced using methods which are generally known in the art. 
In particular, purified ORP may be used to produce antibodies or to screen libraries of pharmaceutical 
agents to identify those which specifically bind ORP. Antibodies to ORP may also be generated using 
methods that arc well known in the art. Such antibodies may include, but are not limited to, polyclonal, 

35 monoclonal, chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab 
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expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are generally 
preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with ORP or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral gels 
such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium narvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to ORP 
have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of at 
least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments arc 
identical to a portion of the amino acid sequence of the natural protein. Short stretches of ORP amino 
acids may be ftised with those of another protein, such as KLH, and antibodies to the chimeric molecule 
may be produced 

Monoclonal antibodies to ORP may be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These include, but are not limited 
to, the hybridoma technique, the human B-ccll hybridoma technique, and the EBV-hybridoma 
technique. (See; e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote, RJ. et at (1983) Proc. Nad. Acad. Sci. USA 80:2026-2030; and 
Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81 :6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
S. et at (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce ORP-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be generated 
by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., Burton, D.R. 
(1991) Proc. Naa Acad Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte population 
or by screening immunoglobulin libraries or panels of highly specific binding reagents as disclosed in 
the literature. (See, e.g„ Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 86:3833-3837; Winter, 
G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for ORP may also be generated. For 
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example, such fragments include, but are not limited to, Ffab^ fragments produced by pepsin digestion 
of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of the ¥(^2 
fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and easy 
identification of monoclonal Fab fragments with the desired specificity. (Sec, e.g., Huse, W.D. et al. 
5 (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 
specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between ORP and its specific 
10 antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering ORP epitopes is generally used, but a competitive binding assay may also be employed 
(Pound, sunra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay techniques 
may be used to assess the affinity of antibodies for ORP. Affinity is expressed as an association 
1 5 constant, which is defined as the molar concentration of ORP-antibody complex divided by the 

molar concentratioas of free antigen and free antibody under equilibrium conditions. The determined 
for a preparation of polyclonal antibodies, which are heterogeneous in their affinities for multiple ORP 
epitopes, represents the average affinity, or avidity, of the antibodies for ORP. The K,, determined for a 
preparation of monoclonal antibodies, which are monospecific for a particular ORP epitope, represents 
a true measure of affinity. High-affinity antibody preparations with ranging from about 10 9 to 10 12 
L/mole are preferred for use in immunoassays in which the ORP-antibody complex must withstand 
rigorous manipulations. Low-affinity antibody preparations with K, ranging from about 10 6 to 10 7 
L/mole are preferred for use in immunopurification and similar procedures which ultimately require 
dissociation of ORP, preferably in active form, from the antibody (Catty, D. (1988) Antibodies, 
Volume I: A Practical Approach. IRL Press, Washington DC; Liddell, J.E. and A. Cryer (1991) A 
Practical Guide to Monoclonal Antibodies . John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to determine 
the quality and suitability of such preparations for certain downstream applications. For example, a 
polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 5-10 mg 
specific antibody/ml, is generally employed in procedures requiring precipitation of ORP-antibody 
complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines for 
antibody quality and usage in various applications, are generally available. (See, e.g., Catty, supra , and 
Coligan et al., supra .) 

In another embodiment of the invention, the polynucleotides encoding ORP, or any fragment or 
complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
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expression can be achieved by designing complementary sequences or antisense molecules (DNA, RNA, 
PNA, or modified oligonucleotides) to the coding or regulatory regioas of the gene encoding ORP. 
Such technology is well known in the art, and antisense oligonucleotides or larger fragments can be 
designed from various locations along the coding or control regions of sequences encoding ORP. (See, 
5 e.g., Agrawal, S., ed. (1996) Antisense Therapeutics. Humana Press Inc., Totawa NJ.) 

In therapeutic use, any gene delivery system suitable for introduction of the antisense 
sequences into appropriate target cells can be used. Antisense sequences can be delivered 
intracellular^ in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
10 Slater, JB. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9(1 3): 1 288- 1 296.) Antisense sequences can also be introduced intracellular^ through the use of viral 
vectors, such as retrovirus and adeno- associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubd. suEra; Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
15 systems known in the art. (See, e.g., Rossi, J.J. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(1 1):1308-1315; and Morris, M.C. et al. (1997) Nucleic Acids Res. 
25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding ORP may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-X1 disease characterized by X-linked 
inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 
immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Huia Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, 
R.G. (1995) Science 270:404-410; Verma, I.M. and N. Somia (1997) Nature 389:239-242)), (ii) 
express a conditionally lethal gene product (eg., in the case of cancers which result from unregulated 
ceil proliferation), or (iii) express a protein which affords protection against intracellular parasites (e.g., 
against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; PoescWa, E. et al. (1996) Proc. Natl. Acad Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruztt . In the 
case where a genetic deficiency in ORP expression or regulation causes disease, the expression of ORP 
from an appropriate population of transduced cells may alleviate the clinical manifestations caused by 
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the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in ORP 
are treated by constructing mammalian expression vectors encoding ORP and introducing these vectors 
by mechanical means into ORP-deficient ceils. Mechanical transfer technologies for use with cells in 
5 vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) ballistic gold particle 
delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene transfer, and (v) the use of 
DNA transposons (Morgan, R.A and W.R Anderson (1993) Annu. Rev. Biochem. 62:191-217; Ivies, 
Z. (1997) Cell 91:501-510; Boulay, J-L. and H. R&ipon (1998) Curr. Opin. Biotechnol. 9:445-450). 
Expression vectors that may be effective for the expression of ORP include, but are not limited 
10 to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad C A), 
PCMV-SCRIPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, 
PTET-ON, PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto C A). ORP may be expressed 
using (i) a constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus 
(RS V), S V40 virus, thymidine kinase (TK), or (3-actin genes), (ii) an inducible promoter (e.g. t the 
15 tetracyclinc-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 

89:5547-5551; Gossen, M. et al. (1995) Science 268.1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:451-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PIND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding ORP from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and AJ. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1 982) EMBO J. 1 :841 -845). The introduction of DNA to primary cells requires modification of these 
standardized mammalian transfection protocols. 

In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to ORP expression are treated by coastructing a retrovirus vector consisting of (i) the 
polynucleotide encoding ORP under the control of an independent promoter or the retrovirus long 
terminal repeal (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 
element (RRE) along with additional retrovirus cw-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (c.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
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Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in an 
appropriate vector producing ceil line (VPCL) that expresses an envelope gene with a tropism for 
receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano. D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61 :1639-1646; Adam, M.A. and 
5 AD. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471 ; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") discloses a 
method for obtaining retrovirus packaging cell lines and is hereby incorporated by reference. 
Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-cells), and the 
10 return of transduced cells to a patient are procedures well known to persons skilled in the art of gene 
therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71 :7020-7029; Bauer, G. et 
al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71 :4707-4716; Ranga, U. et al. 
(1998) Proc. Nad. Acad. Sci. USA 95:1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
15 polynucleotides encoding ORP to cells which have one or more genetic abnormalities with respect to the 
expression of ORP. The construction and packaging of adenovirus-based vectors are well known to 
those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be versatile 
for importing genes encoding immunoregulatory proteins into intact islets in the pancreas (Csete, M.E. 
et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are described in U.S. 
20 Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), hereby incorporated 
by reference. For adenoviral vectors, see also Antinozzi, P.A. et al. (1999) Annu. Rev. Nutr. 
19:51 1-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both incorporated by 
reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 

> polynucleotides encoding ORP to target cells which have one or more genetic abnormalities with respect 
to the expression of ORP. The use of herpes simplex virus (HS V)-based vectors may be especially 
valuable for introducing ORP to cells of the central nervous system, for which HS V has a tropism. The 
construction and packaging of herpes-based vectors are well known to those with ordinary skill in the 
art A replication-competent herpes simplex virus (HSV) type 1-based vector has been used to deliver a 

► reporter gene to the eyes of primates (Liu, X. et al. (1 999) Exp. Eye Res. 1 69:385-395). The 
construction of a HSV-1 virus vector has also been disclosed in detail in U.S. Patent Number 5,804,413 
to DeLuca ("Herpes simplex virus strains for gene transfer"), which is hereby incorporated by 
reference. U.S. Patent Number 5,804,41 3 teaches the use of recombinant HSV d92 which consists of a 
genome containing at least one exogenous gene to be transferred to a cell under the control of the 
appropriate promoter for purposes including human gene therapy. Also taught by this patent are the 
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construction and use of recombinant HSV strains deleted for ICP4, ICP27 and ICP22. For HS V 
vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:5 19-532 and Xu, H. et al. (1994) Dev. Biol. 
163:152-161, hereby incorporated by reference. The manipulation of cloned herpesvirus sequences, the 
generation of recombinant virus following the transfection of multiple plasmids containing different 
5 segments of the large herpesvirus genomes, the growth and propagation of herpesvirus, and the 
infection of cells with herpesvirus are techniques well known to those of ordinary skill in the art 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
deliver polynucleotides encoding ORP to target cells. The biology of the prototypic alphavirus, Semliki 
Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based on the SFV 
10 genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. BiotechnoL 9:464-469). During alphavirus RNA 
replication, a subgenomic RNA is generated that normally encodes the viral capstd proteins. This 
subgenomic RNA replicates to higher levels than the full-length genomic RNA, resulting in the 
overproduction of capsid proteins relative to the viral proteins with enzymatic activity (e.g., protease 
and polymerase). Similarly, inserting the coding sequence for ORP into the alphavirus genome in place 
of the capsid-coding region results in the production of a large number of ORP-coding RN As and the 
synthesis of high levels of ORP in vector transduced cells. While alphavirus infection is typically 
associated with cell lysis within a few days, the ability to establish a persistent infection in hamster 
normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that the lytic replication 
of alphaviruses can be altered to suit the needs of the gene therapy application (Dryga, S. A. ct al. 
(1997) Virology 228:74-83). The wide host range of alphaviruses will allow the introduction of ORP 
into a variety of cell types. The specific transduction of a subset of cells in a population may require 
the seating of cells prior to transductioa The methods of manipulating infectious cDNA clones of 
alphaviruses, performing alphavirus cDNA and RNA transfections, and performing alphavirus 
infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, eg., between about positions -10 
and +10 from the start site, may also be employed to inhibit gene expressioa Similarly, inhibition can 
be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunol ogic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA by 
preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
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molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding ORP. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
5 scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
10 oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared by 
any method known in the art for the synthesis of nucleic acid molecules. These include techniques for 
chemically synthesizing oligonucleotides such as solid phase phosphoramiditc chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA sequences 
15 encoding ORP. Such DNA sequences may be incorporated into a wide variety of vectors with suitable 
RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that synthesize 
complementary RNA, constitutively or inducibly, can be introduced into cell lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' ends 
20 of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PN As and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, and 
wybutosine, as well as acetyl-, methyl-, thir>, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 
25 An additional embodiment of the invention encompasses a method for screening for a 

compound which is effective in altering expression of a polynucleotide encoding ORP. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
30 chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased ORP 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 
encoding ORP may be therapeutically useful, and in the treament of disorders associated with 
35 decreased ORP expression or activity, a compound which specifically promotes expression of the 
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polynucleotide encoding ORP may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 
commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 
polynucleotide encoding ORP is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding ORP are assayed by 
any method commonly known in the art. Typically, the expression of a specific nucleotide is detected 
by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 
polynucleotide encoding ORP. The amount of hybridization may be quantified, thus forming the 
15 basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 
can be carried out, for example, using a Schizosaccharomvces pomhe gene expression system (Atkins, 
20 D. et al. (1999) U.S. Patent No. 5,932,435; Arndt, CM. et al. (2000) Nucleic Acids Res. 28:E15) or a 
human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. Commun. 
268:8-13). A particular embodiment of the present invention involves screening a combinatorial 
library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide nucleic acids, and 
modified oligonucleotides) for antisensc activity against a specific polynucleotide sequence (Bruice, 
25 T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable for 
use in vivo, in vitro, and ex vivo. For ex vivo therapy, vectors may be introduced into stem cells taken 
from the patient and clonally propagated for autologous transplant back into that same patient 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
30 using methods which are well known in the art. (Sec, e.g., Goldman, C.K. et al. (1997) Nat 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of such 
therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
monkeys. 

35 An additional embodiment of the invention relates to the administration of a composition which 
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genially comprises an active ingredient formulated with a pharmaceutical^ acceptable extipient 
Excipients may include, for example, sugars, starches, celluloses, gums, and proteins. Various 
formulations are commonly known and are thoroughly discussed in the latest edition of Remineton's 
Pharmaceutical Sciences (Maack Publishing, Easton PA). Such compositions may consist of ORP, 
antibodies to ORP, and mimetics, agonists, antagonists, or inhibitors of ORP. 

The compositions utilized in this invention may be administered by any number of routes 
including, but not limited to, oral, intravenous, intramuscular, intra arterial, intramedullary, intrathecal, 
intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, intranasal, enteral, topical, 
sublingual, or rectal means. 

Compositioas for pulmonary administration may be prepared in liquid or dry powder form. 
These compositions are generally aerosolized immediately prior to inhalation by the patient. In the case 
of small molecules (e.g. traditional low molecular weight organic drugs), aerosol delivery of fast-acting 
formulations is well-known in the art In the case of macromolecules (e.g. larger peptides and proteins), 
recent developments in the field of pulmonary delivery via the alveolar region of the lung have enabled 
the practical delivery of drugs such as insulin to blood circulation (see, c.g„ Patton, J.S. et al., U.S. 
Patent No. 5,997,848). Pulmonary delivery has the advantage of administration without needle 
injection, and obviates the need for potentially toxic penetration enhancers. 

Compositions suitable for use in the invention include compositions wherein the active 
ingredients are contained in an effective amount to achieve the intended purpose. The determination of 
an effective dose is well within the capability of those skilled in the art. 

Specialized forms of compositioas may be prepared far direct intracellular delivery of 
macromolecules comprising ORP or fragments thereof. For example, liposome preparations containing 
a cell-impermeable macromolecule may promote cell fusion and intracellular delivery of the 
macromolecule. Alternatively, ORP or a fragment thereof may be joined to a short cationic N-terminal 
portion from the HIV Tat-1 protein. Fusion proteins thus generated have been found to transduce into 
the cells of all tissues, including the brain, in a mouse model system (Schwarze, S.R. ei al. (1999) 
Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
culture assays, e.g., of neoplastic cells, or in animal models such as mice, rats, rabbits, (togs, monkeys, 
or pigs. An animal model may also be used to determine the appropriate concentration range and route 
of administration. Such information can then be used to determine useful doses and routes for 
administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example ORP or 
fragments thereof, antibodies of ORP, and agonists, antagonists or inhibitors of ORP, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
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standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED^ (the dose therapeutically effective in 50% of the population) or LD^ (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LD^/ED^ ratio. Compositions which exhibit large 
5 therapeutic indices are preferred. The data obtained from cell culture assays and animal studies are 
used to formulate a range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentratioas that includes the ED 50 with little or no toxicity. 
The dosage varies within this range depending upon the dosage form employed, the sensitivity of the 
patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors rdated to the subject 
requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the active 
moiety or to maintain the desired cliect. Factors which may be taken into account include the severity 
of the disease state, the general health of the subject, the age, weight, and gender of the subject, time 
and frequency of administration, drug combination(s), reaction seasitivities, and response to therapy. 
Long-acting compositions may be administered every 3 to 4 days, every week, or biweekly depending 
on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 g to 1 00,000 g, up to a total dose of 
about 1 gram, depending upon the route of administratioa Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 
conditions, locations, etc. 
DIAGNOSTICS 

in another embodiment, antibodies which specifically bind ORP may be used for the diagnosis 
of disorders characterized by expression of ORP, or in assays to monitor patients being treated with 
ORP or agonists, antagonists, or inhibitors of ORP. Antibodies usefiil for diagnostic purposes may be 
prepared in the same manner as described above for therapeutics. Diagnostic assays for ORP include 
methods which utilize the antibody and a label to detect ORP in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be labeled by 
covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter molecules, 
several of which are described above, are known in the art and may be used 

A variety of protocols for measuring ORP, including ELISAs, RIAs, and FACS, afe known in 
the art and provide a basis for diagnosing altered or abnormal levels of ORP expression. Normal or 
standard values for ORP expression are established by combining body fluids or cell extracts taken 
from normal mammalian subjects, for example, human subjects, with antibody to ORP under conditioas 
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suitable for complex formation. The amount of standard complex formation may be quantitated by 
various methods, such as photometric means. Quantities of ORP expressed in subject, control, and 
disease samples from biopsied tissues are compared with the standard values. Deviation between 
standard and subject values establishes the parameters for diagnosing disease. 
5 In another embodiment of the invention, the polynucleotides encoding ORP may be used for 

diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 
quantify gene expression in biopsied tissues in which expression of ORP may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
10 ORP, and to monitor regulation of ORP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding ORP or closely related molecules may be used to 
identify nucleic acid sequences which encode ORP. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
15 conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding ORP, allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and may have at least 50% 
sequence identity to any of the ORP encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:28-54 or from 
20 genomic sequences including promoters, enhancers, and introns of the ORP gene. 

Means for producing specific hybridization probes for DNAs encoding ORP include the cloning 
of polynucleotide sequences encoding ORP or ORP derivatives into vectors for the production of 
mRNA probes. Such vectors are known in the art, are commercially available, and may be used to 
synthesize RNA probes in vitro by means of the addition of the appropriate RNA polymerases and the 
25 appropriate labeled nucleotides. Hybridization probes may be labeled by a variety of reporter groups, 
for example, by radionuclides such as 32 Por 35 S, or by enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding ORP may be used for the diagnosis of disorders associated 
with expression of ORP. Examples of such disorders include, but are not limited to, a cell proliferative 
30 disorder, such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed 
connective tissue disease, myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, 
psoriasis, primary thrombocythemia; a cancer, such as adenocarcinoma, leukemia, lymphoma, 
melanoma, myeloma, sarcoma, leratocarcinoma, and, in particular, cancers of the adrenal gland, 
bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, 
35 kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, 
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testis, thymus, thyroid, and uterus; an endocrine disorder, such as disorders of the hypothalamus and 
pituitary resulting from lesions such as primary brain tumors, adenomas, infarction associated with 
pregnancy, hypophysectomy, aneurysms, vascular malformations, thrombosis, infections, 
immunological disorders, and complications due to head trauma, disorders associated with 
5 hypopituitarism including hypogonadism, Sheehan syndrome, diabetes insipidus, Kallman's disease, 
Hand-Schuller-Christian disease, Letterer-Siwe disease, sarcoidosis, empty sella syndrome, and 
dwarfism, disorders associated with hyperpituitarism including acromegaly, giantism, and syndrome 
of inappropriate antidiuretic hormone (ADH) secretion (SIADH) often caused by benign adenoma, 
disorders associated with hypothyroidism including goiter, myxedema, acute thyroiditis associated 
10 with bacterial infection, subacute thyroiditis associated with viral infection, autoimmune thyroiditis 
(Hashimoto's disease), and cretinism, disorders associated with hyperthyroidism including 
thyrotoxicosis and its various forms, Grave's disease, pretibial myxedema, toxic multinodular goiter, 
thyroid carcinoma, and Plummer's disease, disorders associated with hyperparathyroidism including 
Conn disease (chronic hypercalemia), pancreatic disorders such as Type I or Type II diabetes mcllitus 
15 and associated complications, disorders associated with the adrenals such as hyperplasia, carcinoma, 
or adenoma of the adrenal cortex, hypertension associated with alkalosis, amyloidosis, hypokalemia, 
Cushing's disease, Liddle's syndrome, and Amold-Healy-Gordon syndrome, pheochromocytoma 
tumors, and Addison's disease, disorders associated with gonadal steroid hormones such as: in 
women, abnormal prolactin production, infertility, endometriosis, perturbations of the menstrual 
20 cycle, polycystic ovarian disease, hyperprolactinemia, isolated gonadotropin deficiency, amenorrhea, 
galactorrhea, hermaphroditism, hirsutism and virilization, breast cancer, and, in postmenopausal 
women, osteoporosis, and, in men, Leydig cell deficiency, male climacteric phase, and germinal cell 
aplasia, hypcrgonadal disorders associated with Leydig cell tumors, androgen resistance associated 
wilh absence of androgen receptors, syndrome of 5 a-reductase, and gynecomastia; a metabolic 
25 disorder, such as Addison's disease, cystic fibrosis, diabetes, fatty hepatocirThosis, galactosemia, 
goiter, hyperadrenalism, hypoadrenalism, hyperparathyroidism, hypoparathyroidism, 
hypercholesterolemia, hyperthyroidism, hypothyroidism hyperlipidemia, hyperlipemia, lipid 
myopathies, obesity, lipodystrophies, and phenylketonuria, congenital adrenal hyperplasia, 
pseudovitamin D-deficiency rickets, cerebrotendinous xanthomatosis, and coumarin resistance; a 
30 reproductive disorder, such as disorders of prolactin production, infertility, including tubal disease, 
ovulatory defects, and endometriosis, disruptions of the estrous cycle, disruptions of the menstrual 
cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, endometrial and ovarian 
tumors, uterine fibroids, autoimmune disorders, ectopic pregnancies, and teratogenesis; cancer of the 
breast, fibrocystic breast disease, and galactorrhea, disruptions of spermatogenesis, abnormal sperm 
35 physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, 
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Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; a neurological 
disorder, such as epilepsy, ischemic cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's 
disease, Pick's disease, Huntington's disease, dementia, Parkinson's disease and other extrapyramidal 
disorders, amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural 
5 muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases of the 
10 nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
15 endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder; an autoimmune/inflammatory disorder, such as acquired 
immunodeficiency syndrome (AJDS), Addison's disease, adult respiratory distress syndrome, 
20 allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, autoimmune 

hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy-candidiasis-ectodermal 
dystrophy (APECED), bronchitis, cholecystitis, contact dermatitis, Crohn's disease, atopic dermatitis, 
dermatomyositis, diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, 
erythroblastosis fetalis, erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's 
25 syndrome, gout, Graves' disease, Hashimoto's thyroiditis, hypereosinophilia, irritable bowel 
syndrome, multiple sclerosis, myasthenia gravis, myocardial or pericardial inflammation, 
osteoarthritis, osteoporosis, pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid 
arthritis, scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, 
systemic sclerosis, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
30 complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 
parasitic, protozoal, and helminthic infections, and trauma; and a viral disorder, such as viral 
infections, e.g., those caused by adenoviruses (acute respiratory disease, pneumonia), arenaviruses 
(lymphocytic choriomeningitis), bunyaviruses (Hantavirus), coronaviruses (pneumonia, chronic 
bronchitis), hepadnaviruses (hepatitis), herpesviruses (herpes simplex virus, varicella-zoster virus, 
Epstcin-Barr virus, cytomegalovirus), flaviviruses (yellow fever), orthomyxoviruses (influenza), 
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papillomaviruses (cancer), paramyxoviruses (measles, mumps), picornoviruses (rhinovirus, 
poliovinis, coxsackie-virus), polyomaviruses (BK virus, JC vims), poxviruses (smallpox), reovirus 
(Colorado tick fever), retroviruses (human immunodeficiency virus, human T lymphotropic vims), 
rhabdoviruses (rabies), rotaviruses (gastroenteritis), and togaviruses (encephalitis, rubella). The 
5 polynucleotide sequences encoding ORP may be used in Southern or northern analysis, dot blot, or 
other membrane-based technologies; in PCR technologies; in dipstick, pin, and multiformat ELISA-Iike 
assays; and in microarrays utilizing fluids or tissues from patients to detect altered ORP expression. 
Such qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding ORP may be usefiil in assays that 
10 detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding ORP may be labeled by standard methods and added to a fluid or tissue sample 

from a patient under conditions suitable for the formation of hybridization complexes. After a suitable 

— 

incubation period, the sample is washed and the signal is quantified and compared with a standard 
value. If the amount of signal in the patient sample is significantly altered in comparison to a control 

15 sample then the presence of altered levels of nucleotide sequences encoding ORP in the sample indicates 
the presence of the associated disorder. Such assays may also be used to evaluate the efficacy of a 
particular therapeutic treatment regimen in animal studies, in clinical trials, or to monitor the treatment 
of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of ORP, a 

20 normal or standard profile for expression is established. This may be accomplished by combining body 
fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding ORP, undo- conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects with values 
from an experiment in which a known amount of a substantially purified polynucleotide is used. 

25 Standard values obtained in this manner may be compared with values obtained from samples from 
patients who are symptomatic for a disorder. Deviation from standard values is used to establish the 
presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in the 

30 patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the development 
35 of the disease, or may provide a means for detecting the disease prior to the appearance of actual 
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clinical symptoms. A more definitive diagnosis of this type may allow health professionals to employ 
preventative measures or aggressive treatment earlier thereby preventing the development or further 
progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding ORP may 
5 involve the use of PCR. These oligomers may be chemically synthesized, generated enzymatically, or 
produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide encoding ORP, or 
a fragment of a polynucleotide complementary to the polynucleotide encoding ORP, and will be 
employed under optimized conditions for identification of a specific gene or conditioa Oligomers may 
also be employed under less stringent conditions for detection or quantification of closely related DNA 
10 or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding ORP may be used to detect single nucleotide polymorphisms (SNPs). SNPs are substitution;, 
insertions and deletions that are a frequent cause of inherited or acquired genetic disease in humans. 
Methods of SNP detection include, but arc not limited to, single-stranded conformation polymorphism 
15 (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers derived from the 
polynucleotide sequences encoding ORP are used to amplify DNA using the polymerase chain reaction 
(PCR). The DNA may be derived, for example, from diseased or normal tissue, biopsy samples, bodily 
fluids, and the like. SNPs in the DNA cause differences in the secondary and tertiary structures of PCR 
products in single-stranded form, and these differences are detectable using geJ electrophoresis in non- 
20 denaturing gels. In fSCCP, the oligonucleotide primers are fluorescently labeled, which allows 
detection of the amplimers in high-throughput equipment such as DNA sequencing machines. 
Additionally, sequence database analysis methods, termed in silico SNP (isSNP), are capable of 
identifying polymorphisms by comparing the sequence of individual overlapping DNA fragments which 
assemble into a common consensus sequence. These computer-based methods filter out sequence 
25 variations due to laboratory preparation of DNA and sequencing errors using statistical models and 
automated analyses of DNA sequence chromatograms. In the alternative, SNPs may be detected and 
characterized by mass spectrometry using, for example, the high throughput MASSARR AY system 
(Sequenom, Inc., San Diego CA). 

Methods which may also be used to quantify the expression of ORP include radiolabeftng or 
30 biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 

standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C. et 
al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 
interest is presented in various dilutions and a specfrophotometric or colorimetric response gives rapid 
35 quantitation. 
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In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 
numbers of genes simultaneously as described in Seilhamer, JJ. et ah, "Comparative Gene Transcript 
5 Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to monitor 
progression/regression of disease as a function of gene expression, and to develop and monitor the 
activities of therapeutic agents in the treatment of disease. In particular, this information may be used 
1 0 to develop a pharmacogenomic profile of a patient in order to select the most appropriate and effective 
treatment regimen for that patient. For example, therapeutic agents which are highly effective and 
display the fewest side effects may be selected for a patient based on his/her pharmacogenomic profile. 

In another embodiment, antibodies specific for ORP, or ORP or fragments thereof may be used 
as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
15 interactions, drug-target interactions, and gene expression profiles, as described above. 

A particular embodiment relates to the use of the polynucleotides of the present invention to 
generate a transcript image of a tissue cm- cell type. A transcript image represents the global pattern of 
gene expression by a particular tissue or cell type. Global gene expression patterns are analyzed by 
quantifying the number of expressed genes and their relative abundance under given conditions and at a 
20 given time. (See Seilhamer et al., "Comparative Gene Transcript Analysis," U.S. Patent Number 
5,840,484, expressly incorporated by reference hereia) Thus a transcript image may be generated by 
hybridizing the polynucleotides of the present invention or their complements to the totality of 
transcripts or reverse transcripts of a particular tissue or cell type. In one embodiment, the 
hybridization takes place in high-throughput format, wherein the polynucleotides of the present 
25 invention or their complements comprise a subset of a plurality of elements on a microarray. The 
resultant transcript image would provide a profile of gene activity. 

Transcript images may be generated using transcripts isolated from tissues, cell lines, biopsies, 
or other biological samples. The transcript image may thus reflect gene expression in vivo, as in the 
case of a tissue or biopsy sample, or in vitro , as in the case of a cell line. 
30 Transcript images which profile the expression of the polynucleotides of the present invention 

may also be used in conjunction with in vitro model systems and preclinical evaluation of 
pharmaceuticals, as well as toxicological testing of industrial and naturally-occurring environmental 
compounds. All compounds induce characteristic gene expression patterns, frequently termed 
molecular fingerprints or toxicant signatures, which are indicative of mechanisms of action and toxicity 
35 (Nuwaysir, E.F. et al. (1999) Mol. Carcinog. 24:153-159; Steiner, S. and N.L. Anderson (2000) 
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Toxicol. Lett 1 12-1 13:467-471, expressly incorporated by reference herein). If a test compound has a 
signature similar to that of a compound with known toxicity, it is likely to share those toxic properties. 
These fingerprints or signatures are most useful and refined when they contain expression information 
from a large number of genes and gene families. Ideally, a genome-wide measurement of expression 
5 provides the highest quality signature. Even genes whose expression is not altered by any tested 
compounds are important as well, as the levels of expression of these genes are used to normalize the 
rest of the expression data. The normalization procedure is useful for comparison of expression data 
after treatment with different compounds. While the assignment of gene function to elements of a 
toxicant signature aids in interpretation of toxicity mechanisms, knowledge of gene function is not 
10 necessary for the statistical matching of signatures which leads to prediction of toxicity. (See, for 
example, Press Release 00-02 from the National Institute of Environmental Health Sciences, released 
February 29, 2000, available at http^/www. niehs.nih.gov/oc/news/toxchip.htm.) Therefore, it is 
important and desirable in toxicological screening using toxicant signatures to include all expressed 
gene sequences. 

15 In one embodiment, the toxicity of a test compound is assessed by treating a biological sample 

containing nucleic acids with the test compound. Nucleic acids that are expressed in the treated 
biological sample are hybridized with one or more probes specific to the polynucleotides of the 
present invention, so that transcript levels corresponding to the polynucleotides of the present 
invention may be quantified. The transcript levels in the treated biological sample are compared with 

20 levels in an untreated biological sample. Differences in the transcript levels between the two samples 
are indicative of a toxic response caused by the test compound in the treated sample. 

Another particular embodiment relates to the use of the polypeptide sequences of the present 
invention to analyze the protcome of a tissue or cell type. The term proteome refers to the global 
pattern of protein expression in a particular tissue or cell type. Each protein component of a proteome 

25 can be subjected individually to further analysis. Proteome expression patterns, or profiles, are 
analyzed by quantifying the number of expressed proteins and their relative abundance under given 
conditions and at a given time. A profile of a cell's proteome may thus be generated by separating and 
analyzing the polypeptides of a particular tissue or cell type. In one embodiment, the separation is 
achieved using two-dimensional gel dectrophoresis, in which proteins from a sample are separated by 

30 isoelectric focusing in the first dimension, and thai according to molecular weight by sodium dodecyl 
sulfate slab gel electrophoresis in the second dimension (Steiner and Anderson, supra) . The proteins are 
visualized in the gd as discrete and uniquely positioned spots, typically by staining the gel with an agent 
such as Coomassie Blue or silver or fluorescent stains. The optical density of each protdn spot is 
generally proportional to the levd of the protdn in the sample. The optical densities of equivalent^ 

35 positioned protdn spots from different samples, for example, from biological samples either treated or 
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untreated with a test compound or therapeutic agent, are compared to identify any changes in protein 
spot density related to the treatment The proteins in the spots are partially sequenced using, for 
example, standard methods employing chemical or enzymatic cleavage followed by mass spectrometry. 
The identity of the protein in a spot may be determined by comparing its partial sequence, preferably of 
5 at least 5 contiguous amino acid residues, to the polypeptide sequences of the present invention. In 
some cases, further sequence data may be obtained for definitive protein identification. 

A proteomic profile may also be generated using antibodies specific for ORP to quantify the 
levels of ORP expression. In one embodiment, the antibodies are used as elements on a microarray, and 
protein expression levels are quantified by exposing the microarray to the sample and detecting the 
10 levels of protein bound to each array element (Lueking, A. et al. (1999) Anal. Biochem. 270:103-1 1 1 ; 
Mendoze, L.G. et al. (1999) Biotechniques 27:778-788). Detection may be performed by a variety of 
methods known in the art, for example, by reacting the proteins in the sample with a thiol- or amino- 
reactive fluorescent compound and detecting the amount of fluorescence bound at each array element 

Toxicant signatures at the proteome level are also useful for toxicological screening, and should 

15 be analyzed in parallel with toxicant signatures at the transcript level. There is a poor correlation 
between transcript and protein abundances for some proteins in some tissues (Anderson, N.L. and J. 
Seilhamer (1997) Electrophoresis 18:533-537), so proteome toxicant signatures may be useful in the 
analysis of compounds which do not significantly affect the transcript image, but which alter the 
proteomic profile. In addition, the analysis of transcripts in body fluids is difficult, due to rapid 

20 degradation of mRNA, so proteomic profiling may be more reliable and informative in such cases. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteins that are expressed in the treated biological 
sample arc separated so that the amount of each protein can be quantified. The amount of each protein 
is compared to the amount of the corresponding protean in an untreated biological sample. A difference 

25 in the amount of protein between the two samples is indicative of a toxic response to the test compound 
in the treated sample. Individual proteins are identified by sequencing the amino acid residues of the 
individual proteins and comparing these partial sequences to the polypeptides of the present invention. 

In another embodiment, the toxicity of a test compound is assessed by treating a biological 
sample containing proteins with the test compound. Proteias from the biological sample are incubated 

30 with antibodies specific to the polypeptides of the present invention. The amount of protein recognized 
by the antibodies is quantified. The amount of protein in the treated biological sample is compared with 
the amount in an untreated biological sample. A difference in the amount of protein between the two 
samples is indicative of a toxic response to the test compound in the treated sample. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 

35 Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 
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USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad Sci. USA 94:2150- 
2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are wdl 
known and thoroughly described in DNA Microarravs: A Practical Approach . M. Schena, ed (1 999) 
5 Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding ORP may be used to 
generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
10 of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
15 Genet 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop genetic 
linkage maps, for example, which correlate the inheritance of a disease state with the inheritance of a 
particular chromosome region or restriction fragment length polymorphism (RFLP). (See, e.g., 
Lander, E.S. and D. Botstein (1986) Proc. NaU. Acad. Sci. USA 83:7353-7357.) 
20 Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic map 

data. (See, e.g., Hdnz-Ulrich, et al. (1995) in Meyers, supra , pp. 965-968.) Examples of genetic map 
data can be found in various scientific journals or at the Online Mendelian Inheritance in Man (OM1M) 
World Wide Web site. Correlation between the location of the gene encoding ORP on a physical map 
and a specific disorder, or a predisposition to a specific disorder, may help define the region of DNA 
25 associated with that disorder and thus may further positional cloning efforts. 

Jnsitu hybridization of chromosomal preparations and physical mapping techniques, such as 
linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 
30 valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely localized 
by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23„any sequences 
mapping to that area may represent associated or regulatory genes for further investigation. (See, e.g., 
Gatti, R.A. el al. (1988) Nature 336:577-580.) The nucleotide sequence of the instant invention may 
35 also be used to detect differences in the chromosomal location due to translocation, inversion, etc., 
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among normal, carrier, or affected individuals. 

In another embodiment of the invention, ORP, its catalytic or immunogenic fragments, or 
oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
5 solid support, borne on a cell surface, or located intracellular^. The formation of binding complexes 
between ORP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 
having suitable binding affinity to the protein of interest (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
10 synthesized on a solid substrate. The test compounds are reacted with ORP, or fragments thereof, and 
washed Bound ORP is then detected by methods well known in the art. Purified ORP can also be 
coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 
non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
15 antibodies capable of binding ORP specifically compete with a test compound for binding ORP. In this 
manner, antibodies can be used to detect the presence of any peptide which shares one or more antigenic 
determinants with ORP. 

In additional embodiments, the nucleotide sequences which encode ORP may be used in any 
molecular biology techniques that have yet to be developed, provided the new techniques rely on 
20 properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 
description, utilize the present invention to its fullest extent The following preferred specific 
embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
25 of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, in 
particular U.S. Ser. No. 60/1 72,367, are hereby expressly incorporated by reference. 

EXAMPLES 

30 I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some tissues 
were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized and lysed 
in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a monophasic 
solution of phenol and guanidine isothiocyanate. The resulting lysales were centrifuged over CsCI 
35 cushioas or extracted with chloroform. RNA was precipitated from the lysates with either isopropanol 
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or sodium acetate and ethanol, or by other routine methods. 

Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 
purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN, 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g„ the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 
libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra , units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 
oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-sdected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the polylinker of a suitable plasmid, e.g., 
PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2. 1 plasmid 
(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto C A). Recombinant 
plasmids were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or SOLR 
from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
II. Isolation of cDNA Clones 

Plasmids obtained as described in Example I were recovered from host cells by in vivo excision 
using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using at least 
one of the following: a Magic or WIZARD Minipreps DN A purification system (Promega); an AGTC 
Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, Q1AWELL 
8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E. A.L. PREP 96 plasmid 
purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0.1 ml of 
distilled water and stored, with or without lyophilization, at 4 C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 384- 
well plates, and the concentration of amplified plasmid DNA was quantified fluorometricaJly using 
PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence scanner 
(Labsystems Oy, Helsinki, Finland). 
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III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
Sequencing reactions were processed using standard methods or high-throughput instrumentation 
such as the ABI CATALYST 800 (Applied Biosystems) thermal cycler or the PTC-200 thermal cycler 
(MJ Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the 
M1CROLAB 2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared 
using reagents provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as 
the ABI PRISM BIGDYE Terminator cycle sequencing ready reaction kit (Applied Biosystems). 
Electrophoretic separation of cDNA sequencing reactions and detection of labeled polynucleotides were 
carried out using the MEGABACE KMX) DNA sequencing system (Molecular Dynamics); the ABI 
PRISM 373 or 377 sequencing system (Applied Biosystems) in conjunction with standard ABI 
protocols and base calling software; or other sequence analysis systems known in the art. Reading 
frames within the cDNA sequences were identified using standard methods (reviewed in Ausubel, 1997, 
suDEa, unit 7.7). Some of the cDNA sequences were selected for extension using the techniques 
disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable descriptions, 
references, and threshold parameters. The first column of Table 5 shows the tools, programs, and 
algorithms used, the second column provides brief descriptions thereof, the third column presents 
appropriate references, all of which are incorporated by reference herein in their entirety, and the fourth 
column presents, where applicable, the scores, probability values, and other parameters used to evaluate 
the strength of a match between two sequences (the higher the score, the greater the homology between 
two sequences). Sequences were analyzed using M ACDNASIS PRO software (Hitachi Software 
Engineering, South San Francisco CA) and LASERGENE software (DNASTAR). Polynucleotide and 
polypeptide sequence alignments were generated using the default parameters specified by the clustal 
algorithm as incorporated into the MEG ALIGN multisequence alignment program (DNASTAR), which 
also calculates the percent identity between aligned sequences. 

The polynucleotide sequences were validated by removing vector, linker, and polyA sequences 
and by masking ambiguous bases, using algorithms and programs based on BLAST, dynamic 
programing, and dinucleoUde nearest neighbor analysis. The sequences were then queried against a 
selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, and 
eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire annotation 
using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled into ftill 
length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and were screened 
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for open reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 

polynucleotide sequences were translated to derive the corresponding full length amino acid sequences, 

and these full length sequences were subsequently analyzed by querying against databases such as the 

GenBank databases (described above), SwissProt, BLOCKS, PRINTS, DOMO, PRODOM, Prosite, 
5 and Hidden Markov Model (HMM)- based protein family databases such as PFAM. HMM is a 

probabilistic approach which analyzes consensus primary structures of gene families. (See, e.g., 

Eddy, S.R. (1996) Curr. Opin. Struct Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 

amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
10 NO:28-54. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 

amplification technologies were described in The Invention section above. 

IV* Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a gene 

and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs from a 
15 particular cell type or tissue have been bound. (See, e.g., Sambrook, supra , ch. 7; Ausubel, 1995, 

supra , ch. 4 and 16 ) 

Analogous computer techniques applying BLAST were used to search for identical or related 
molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 
much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
20 search can be modified to determine whether any particular match is categorized as exact or similar. 
The basis of the search is the product score, which is defined as: 

BLAST Score x Percent Identity 
5 x minimum {length(Seq. 1), length(Seq. 2)} 

25 The product score takes into account both the degree of similarity between two sequences and the length 
of the sequence match. The product score is a normalized value between 0 and 100, and is calculated 
as follows: the BLAST score is multiplied by the percent nucleotide identity and the product is divided 
by (5 times the length of the shorter of the two sequences). The BLAST score is calculated by 
assigning a score of +5 for every base that matches in a high-scoring segment pair (HSP), and -4 for 

30 every mismatch. Two sequences may share more than one HSP (separated by gaps). If there is more 
than one HSP, then the pair with the highest BLAST score is used to calculate the product score. The 
product score represents a balance between fractional overlap and quality in a BLAST alignment For 
example, a product score of 100 is produced only for 100% identity over the entire length of the shorter 
of the two sequences being compared. A product score of 70 is produced either by 1 00% identity and 

35 70% overlap at one end, or by 88% identity and 1 00% overlap at the other. A product score of 50 is 
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produced either by 100% identity and 50% overlap at one end, or 79% identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding ORP occurred. Analysis involved the categorization of cDNA libraries by 
organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
5 developmental, endocrine, gastrointestinal, hematopoictic/immune, musculoskeletal, nervous, 

reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in Table 
10 3. 

V. Chromosomal Mapping of ORP Encoding Polynucleotides 

The cDN A sequences which were used to assemble SEQ ID NO:28-54 were compared with 
sequences from the Incyte LIFESEQ database and public domain databases using BLAST and other 
implementations of the Smith- Waterman algorithm. Sequences from these databases that matched 
15 SEQ ID NO:28-54 were assembled into clusters of contiguous and overlapping sequences using 
assembly algorithms such as Phrap (Table 5). Radiation hybrid and genetic mapping data available 
from public resources such as the Stanford Human Genome Center (SHGC), Whitehead Institute for 
Genome Research (WIGR), and G6n6thon were used to determine if any of the clustered sequences 
had been previously mapped. Inclusion of a mapped sequence in a cluster resulted in the assignment 
of all sequences of that cluster, including its particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:31, SEQ ID NO:42, SEQ ID NO:48, and SEQ ID 
NO:53 are described in The Invention as ranges, or intervals, of human chromosomes. More than one 
map location is reported for SEQ ID NO:48, indicating that previously mapped sequences having 
similarity, but not complete identity, to SEQ ID NO:48 were assembled into their respective clusters. 
The map position of an interval, in centiMorgans, is measured relative to the terminus of the 
chromosome's p-arm. (The centiMorgan (cM) is a unit of measurement based on recombination 
frequencies between chromosomal markers. On average, 1 cM is roughly equivalent to 1 megabase 
(Mb) of DNA in humans, although this can vary widely due to hot and cold spots of recombination.) 
The cM distances are based on genetic markers mapped by G6n6thon which provide boundaries for 
radiation hybrid markers whose sequences were included in each of the clusters. Human genome 
maps and other resources available to the public, such as the NCBI "GeneMap'99 n World Wide Web 
site (http://www.ncbi.nlm.mh.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 

VI. Extension of ORP Encoding Polynucleotides 
The full length nucleic acid sequences of SEQ ID NO:28-54 were produced by exleasion of an 
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appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3* extension of the known fragment. The initial primers were designed using OLIGO 
4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 nucleotides 
in length, to have a GC content of about 50% or more, and to anneal to the target sequence at 
temperatures of about 68 C to about 72 C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizatioas was avoided. 

Selected human cDN A libraries were used to extend the sequence. If more than one extension 
was necessary or desired, additional or nested sets of primers were designed 

High fidelity amplification was obtained by PCR using methods well known in the art. PGR 
was performed in 96-wcll plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2+ , (NH 4 ) 2 S0 4 , 
and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pftj DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1 : 94 C, 3 min; Step 2: 94 C, 15 sec; Step 3: 60 C, 1 min; Step 4: 68 C, 
2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 C, 5 min; Step 7: storage at 4 C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94 C, 3 min; Step 2: 
94 C, 15 sec; Step 3: 57 C, 1 min; Step 4: 68 C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68 C, 5 min; Step 7: storage at 4 C. 

The concentration of DNA in each well was determined by dispensing 100 \x\ PICOGREEN 
quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 pi of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 ltolO 1 aliquot of the reaction mixture was analyzed by electrophoresis 
on a 1 % agarose mini-gel to determine which reactions were successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison Wl), and 
sonicated or sheared prior to rdigation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) agarose 
gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones were 
religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site overhangs, 
and transfectcd into competent E. coli cells. Transformed cells were selected on antibiotic-containing 
media, and individual colonies were picked and cultured overnight at 37 C in 384-wc31 plates in LB/2x 
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carb liquid media. 

The ceils were lysed, and DNA was amplified by PCR using Taq DNA polymerase (Amersham 
Pharmacia Biotech) and Pfu DNA polymerase (Stratagcne) with the following parameters: Step 1 : 
94 C, 3 min; Step 2: 94 C, 15 sec; Step 3: 60 C, 1 min; Step 4: 72 C, 2 min; Step 5: steps 2, 3, and 4 
5 repeated 29 times; Step 6: 72 C, 5 min; Step 7: storage at 4 C. DNA was quantified by PICOGREEN 
reagent (Molecular Probes) as described above. Samples with low DNA recoveries were reamplified 
using the same conditions as described above. Samples were diluted with 20% dimethysulfoxide (1 :2, 
v/v), and sequenced using DYENAMIC energy transfer sequencing primers and the DYEN AMIC 
DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM BIGDYE Terminator cycle 
10 sequencing ready reaction kit (Applied Biosystems). 

In like manner, the polynucleotide sequences of SEQ ID NO:28-54 are used to obtain 5' 
regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VII. Labeling and Use of Individual Hybridization Probes 

1 5 Hybridization probes deri ved from SEQ ID NO:28-54 are employed to screen cDNAs, genomic 

DNAs, or mRN As. Although the labeling of oligonucleotides, consisting of about 20 base pairs, is 
specifically described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 software (National 
Biosciences) and labeled by combining 50 pmol of each oligomer, 250 Ci of [y- 32 P] adenosine 

20 triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase (DuPont NEN, Boston 
MA). The labeled oligonucleotides are substantially purified using a SEPH ADEX G-25 superfine size 
exclusion dcxtran bead column (Amersham Pharmacia Biotech). An aliquot containing 10 7 counts per 
minute of the labeled probe is used in a typical membrane-based hybridization analysis of human 
genomic DNA digested with one of the following endonucleases: Ase I, Bgl IT, Eco RI, Pst I, Xba I, or 

25 Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (Nytran Plus, Schleicher & Schueil, Durham NH). Hybridization is carried out for 16 
hours at 40 C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 

30 Hybridization patterns are visualized using autoradiography or an alternative imaging means and 
compared. 

VIIL Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, Sec, e.g., Baldeschweiler, supra) , mechanical 
35 microspolting technologies, and derivatives thereof. The substrate in each of the aforementioned 



64 



WO 01/44448 PCT/USOO/33158 

technologies should be uniform and solid with a non-porous surface (Schena (1999), supra) . Suggested 
substrates include silicon, silica, glass slides, glass chips, and silicon wafers. Alternatively, a procedure 
analogous to a dot or slot blot may also be used to arrange and link dements to the surface of a 
substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical array may be 
produced using available methods and machines well known to those of ordinary skill in the art and may 
contain any appropriate number of dements. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; 
Shalon, D. et al. (1996) Genome Res. 6:639-645; Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 
16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof may 
comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can be 
selected using software well known in the art such as LASERGENE software (DNASTAR). The array 
elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 
complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described in 
detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 
poly(A)* RNA is purified using the oligo-(dT) cellulose method. Each poly(A)* RNA sample is 
reverse transcribed using MMLV rcveree-transcriptase, 0.05 pg/^l oligo-(dT) prima- (21mer), IX first 
strand buffer, 0.03 units/pl RNase inhibitor, 500 uM dATP, 500 \M dGTP, 500 pM dTTP, 40 pM 
dCTP, 40 pM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poly(A) + RNA with 
GEMBRIGHT kits (lncyte). Specific control poly(A)* RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 
(CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 pi 5X SSC/0.2% SDS. 
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Microarray Preparation 

Sequences of the present invention are used to generate array elements. Each array element is 
amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification uses 
primers complementary to the vector sequences flanking the cDNA insert Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
Mg. Amplified amy elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 
slides (Corning) are cleaned by ultrasound in 0. 1 % SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 
oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 pi of the array element DNA, at an average 
concentration of 100 ng/pl, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

Microarrays are UV-crosslinked using a STRATALINKER UV-crosslinker (Stratagene). 
Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 C followed by washes in 
0.2% SDS and distilled water as before. 
Hybridization 

Hybridization reactions contain 9 pi of sample mixture consisting of 0.2 pg each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65 C for 5 minutes and is ali quoted onto the microarray surface and covered with 
an 1.8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 pi of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60 C. The arrays are washed for 10 min at 45 C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45 C in a second wash buffer (0.1X SSC), and dried. 
Detection 

Reporter-labeled hybridization complexes are detected with a microscope equipped with an 
Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
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focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 
scanned past the objective. Hie 1 .8 cm x 1 .8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 

In two separate scans, a mixed gas multiline laser excites the two fluoropbores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
the array contains a complementary DNA sequence, allowing the intensity of the signal at that 
location to be correlated with a weight ratio of hybridizing species of 1 : 100,000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-coior transformation to a pseudocolor scale ranging from blue Qow signal) to red (high 
signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore's emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
IX. Complementary Polynucleotides 

Sequences complementary to the ORP-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring ORP. Although use of oligonucleotides 
comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
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4.06 software (National Biosciences) and the coding sequence of ORP. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5* sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
designed to prevent ribosomal binding to the ORP-encoding transcript. 
X, Expression of ORP 

Expression and purification of ORP is achieved using bacterial or virus-based expression 
systems. For expression of ORP in bacteria, cDNA is subcloned into an appropriate vector containing 
an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA transcription. 
Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid promoter and the 
T5 orT7 bacteriophage promoter in conjunction with the lac operator regulatory element. 
Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). Antibiotic 
resistant bacteria express ORP upon induction with isopropyl beta-D-thiogalactopyranosidc (ffTG). 
Expression of ORP in eukaryotic cells is achieved by infecting insect or mammalian cell lines with 
recombinant Autographica californica nuclear polyhedrosis virus (AcMNPV), commonly known as 
baculovirus. The nonessential polyhcdrin gene of baculovirus is replaced with cDNA encoding ORP by 
either homologous recombination or bacterial-mediated transposition involving transfer plasmid 
intermediates. Viral infectivity is maintained and the strong polyhedrin promoter drives high levels of 
cDNA transcription. Recombinant baculovirus is used to infect Snodontera frugioerda (Sf9) insect 
cells in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional 
genetic modiflcations to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad Sci. USA 
91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945.) 

In most expression systems, ORP is synthesized as a fusion protein with, e.g., glutathione S- 
transferasc (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 
affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26-kilodalton 
enzyme from Schistosom a iaponicum. enables the purification of fusion proteins on immobilized 
glutathione under conditions that maintain protein activity and antigenicity (Amersham Pharmacia 
Biotech). Following purification, the GST moiety can be proteolyticafly cleaved from ORP at 
specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffinity purification 
using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman Kodak). 6- 
His, a stretch of six consecutive histidine residues, enables purification on metal-chdate resins 
(QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, supra , 
ch. 10 and 1 6). Purified ORP obtained by these methods can be used directly in the assays shown in 
Examples XI and XV. 
XL Demonstration of ORP Activity 

For purposes of example, an assay demonstrating the activity of a short-chain alcohol 
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dehydrogenase is described. Essentially the same method is used for other types of oxidoreductases, 
with suitable substitution of the substrate and electron acceptor. ORP activity is demonstrated by the 
oxidation of NADPH to NADP in the presence of substrate (Kunau and Dommes (1978) Eur. J. 
Biochem. 91 :533-544). Substrates include, but are not limited to, all-trans-retinaldehyde and cis-4- 
5 dienoyl-CoA. ORP is preincubated for 10 minutes at 37 C in 60 pM potassium phosphate (pH 7.4), 
125 nM NADPH, and 0.2 pM CoA (coenzyme A). The reaction is initiated by addition of the 
appropriate substrate (12.5 to 150 pM final concentration). The change in absorbance of the reaction 
at 340 nm, due to the oxidation of NADPH to NADP, is measured using a spectrophotometer at 23 
°C. Units of ORP activity are expressed as pmoles of NADP formed per minute. A reaction lacking 
10 ORP is used as a negative control. 

Alternatively, ORP activity is assayed by measuring the reduction of insulin. Aliquots of 
ORP arc preincubated at 37 °C for 20 min with 2 pi of 50 mM Hepes, pH 7.6, 100 pg/ml bovine 
serum albumin, and 2 mM DTT in a total volume of 70 pi. Then, 40 pi of a reaction mixture 
composed of 200 pi of Hepes (1 M), pH 7.6, 40 pi of EDTA (0.2 M), 40 pi of NADPH (40 mg/ml), 

15 and 500 pi of insulin (10 mg/ml) is added. The reaction is initiated with the addition of 10 pi of 

thioredoxin reductase from calf thymus (3.0 A41 2 unit), and incubation is continued for 20 min at 37 
°C. The reaction rate is followed by monitoring the oxidation of NADPH at 412 nM. The oxidation 
of NADPH is proportional to the amount of insulin reduction. 
XII. Functional Assays 

20 ORP function is assessed by expressing the sequences encoding ORP at physiologically 

elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
include pCMV SPORT plasmid (Life Technologies) and pCR3.1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10 g of recombinant vector are transiently transfected into a 

25 human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 

formulations or electroporation. 1-2 g of an additional plasmid containing sequences encoding a 
marker protein are co- transfected. Expression of a marker protein provides a means to distinguish 
transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP, 

30 Clontech), CD64, or a CD64-GFP fusion protda Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate the 
apoptotic slate of the cells and other cellular properties. FCM detects and quantifies the uptake of 
fluorescent molecules that diagnose events preceding or coincident with cell death. These events include 
changes in nuclear DNA content as measured by staining of DNA with propidium iodide; changes in 

35 cell size and granularity as measured by forward light scatter and 90 degree side light scatter; down- 
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regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; alterations in 
expression of cdl surface and intracellular proteins as measured by reactivity with specific antibodies; 
and alterations in plasma membrane composition as measured by the binding of fluorescein-conjugated 
Annexin V protein to the cell surface. Methods in flow cytometry are discussed in Ormerod, M.G. 
5 ( 1 994) Flow Cytometry , Oxford, New York NY. 

The influence of ORP on gene expression can be assessed using highly purified populations of 
cells transfected with sequences encoding ORP and either CD64 or CD64-GFP. CD64 and CD64-GFP 
are expressed on the surface of transfected cells and bind to conserved regions of human 
immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
10 magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success NY). 
mRN A can be purified from the cells using methods well known by those of skill in the art. Expression 
of mRN A encoding ORP and other genes of interest can be analyzed by northern analysis or microarray 
techniques. 

XIII. Production of ORP Specific Antibodies 

*5 ORP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, c.g., 

Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the ORP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 

20 synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-lerminus or in hydrophilic regioas are well 
described in the art (See, e.g., Ausubd, 1995, sunra . ch. 1 1 .) 

Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431 A 
peptide synthesizer (Applied Biosystems) using FMOC chemistry and coupled to KLH (Sigma- Aldrich, 

25 St. Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995, supra .) Rabbits are immunized with the oligopeptide-KLH 
complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti-ORP 
activity by, for example, binding the peptide or ORP to a substrate, blocking with 1 % BS A, reacting 
with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 

30 XIV. Purification of Naturally Occurring ORP Using Specific Antibodies 

Naturally occurring or recombinant ORP is substantially purified by immunoaffinity 
chromatography using antibodies specific for ORP. An immunoaffinity column is constructed by 
covalently coupling anti-ORP antibody to an activated chromatographic resin, such as CNBr-activated 
SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is blocked and washed 

35 according to the manufacturer's instructions. 
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Media containing ORP are passed over the immunoaflinity column, and the column is washed 
under conditions that allow the preferential absorbance of ORP (e.g., high ionic strength buffers in the 
presence of detergent). The column is eluted under conditions that disrupt antibody/ORP binding (e.g., 
a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, such as urea or thiocyanate ion), and 
ORP is collected 

XV. Identification of Molecules Which 1 nterac t with ORP 

ORP, or biologically active fragments thereof, are labeled with I25 I Bolton-Hunter reagent. 
(See, e.g„ Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 
previously arrayed in the wells of a multi-well plate are incubated with the labeled ORP, washed, and 
any wells with labeled ORP complex are assayed. Data obtained using different concentrations of ORP 
are used to calculate values for the number, affinity, and association of ORP with the candidate 
molecules. 

Alternatively, molecules interacting with ORP are analyzed using the yeast two-hybrid 
system as described in Fields, S. and O. Song (1989, Nature 340:245-246), or using commercially 
available kits based on the two-hybrid system, such as the MATCHMAKER system (Clontech). 

ORP may also be used in the PATHCALL1NG process (CuraGen Corp., New Haven CT) 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interactions 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
No. 6,057,101). 

Various modificatioas and variations of the described methods and systems of the invention will 
be apparent to those skilled in the art without departing from the scope and spirit of the invention. 
Although the invention has been described in connection with certain embodiments, it should be 
understood that the invention as claimed should not be unduly limited to such specific embodiments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvious 
to those skilled in molecular biology or related fields are intended to be within the scope of the following 
claims. 
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Vector 1 


pINCY | 


PSPORTl 


pINCY 


pINCY 


fit 

H 

a 


PSPORTl 


pINCY 


fit 

H 
Q* 


Disease or Condition 
(Fraction of Total) 


Cancer (0.524) 
Inflammation ( 0 . 262 ) 


Trauma (1.000) 


Cancer (0.420) 

Inflammation (0 . 280) 

Cell proliferation (0.200) 


Trauma (0.429) 
Cell proliferation (0.286) 
Inflammation (0 . 143 ) 
Neurological (0.143) 


Cancer (0.538) 

Cell proliferation (0.192) 

Inflammation (0 . 192) 


Cancer (0.385) 
Cell proliferation (0.154) 
! Inflammation (0.154) 
Trauma (0.154) 


Cancer (0.463) 

Inflammation (0.232) 

Cell proliferation (0.168) 


Cancer (0.333) 
Inflammation (0 . 333 ) 
Neurological (0. 167) 
Trauma (0.167) 


Tissue Expression 
(Fraction of Total) 


Gastrointestinal (0 . 405) 
Reproductive (0.262) 
Urologic (0.119) 


Nervous (1.000) 


Reproductive (0.300) 
Gastrointestinal (0 . 160) 
Nervous (0.120) 


Cardiovascular (0.143) 
Developmental ( 0 . 143 ) 
Endocrine (0.143) 
Gastrointestinal ( 0 . 143 ) 
Reproductive ( 0 . 143 ) 
Urologic (0.143) 
Nervous (0.143) 


Reproductive (0.308) 
Nervous (0.154) 
Cardiovascular (0.115) 
Hematopoietic/Immune (0 . 115) 
Gastrointestinal (0. 115) 


I Nervous (0.538) 
Hematopoietic/Immune (0.154) 
Reproductive (0.154) 


Reproduc t ive (0.316) 
Gastrointestinal (0 . 137) 
Cardiovascular (0 . 137) 


Nervous (0.500) 
Endocrine (0.167) 
Musculoskeletal (0.167) 
Gastrointestinal (0 . 167) 


Selected 
Fragments 




1-79 
173-215 
440-549 




1-304 
572-1196 

-j 


448-765 
1039-1926 


490-849 
1072-1152 


00 

ro 
J 

tH 


243-539 
555-1352 


Nucleotide 
SEQ ID NO: 


ro 


00 

ro 


ro 
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tH 
^» 




ro 
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Vector | 


pINCY 


5h 
w 

a 


pINCY 


PBLUESC 
RIPT | 


pINCY I 


>1 

H 

a 


pINCY 


pINCY 


pINCY 


pINCY J 


Disease or Condition 
(Fraction of Total) 


Cancer (0 .750) 
Inflammation (0 . 250) 


Cancer (0.400) 
Inflammation (0. 257) 
Trauma (0.171) 


Cancer (0.474) 

Inflammation (0.368) 

Cell proliferation (0.105) 


Cancer (0.471) 

Inflammation (0 . 235) • 
Trauma (0.206) 


Cancer (0.389) 

Inflammation (0 . 250) 

Cell proliferation (0.222) 


Cancer (0.353) 

Inflammation (0 . 275) 

Cell proliferation (0.255) 


Cancer (0.442) 

Inflammation (0 . 273) 

Cell proliferation (0.156) 


Cancer (0.447) 

Inflammation (0.191) 

Cell proliferation (0.138) 


Cancer (0.510) 

Cell proliferation (0.204) 

Inflammation (0 . 204) 


Cancer (0.639) 
Inflammation (0 . 213 ) 


Tissue Expression 
(Fraction of Total) 


Gastrointestinal (0.500) 
Reproductive (0.250) 
Nervous (0.250) 


Reproductive (0.229) 
Nervous (0.171) 
Cardiovascular (0.143) 


Nervous (0.211) 
Reproductive (0 . 211) 
Gastrointestinal (0.158) 
Hematopoietic /Immune (0 . 158) 


Gastrointestinal (0 . 412) 
Reproductive (0 . 206) 
Nervous (0.118) 


Reproductive (0.361) 
Nervous (0.194) 
Developmental (0.111) 


Reproductive (0.235) 
Nervous (0.216) 
Hematopoietic/Immune (0 . 157) 


Reproductive (0.247) 
Nervous (0 . 195) 
Gastrointestinal (0.143) 


Repr oduc t i ve (0.255) 
Nervous (0.181) 
Gastrointestinal (0 . 138) 


Reproductive (0 . 327) 
Nervous (0.143) 
Gastrointestinal (0.102) 


Reproductive (0 . 328) 
Nervous (0.197) 
Cardiovascular (0. 180) 


Selected 
Fragments 


127-171 


1-80 
300-338 
792-833 


465-527 
612-692 
810-968 


655-1305 


1-69 
1397-1459 


1-51 

157-198 
1246-2101 


1005-1073 
1233-1328 


CN 
00 

1 

rH 


1-94 


1-469 
998-1624 


Nucleotide 

SEQ ID NO: 


in 


VD 
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o 
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CN 

in 
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in 
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1. An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-27, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l-27, and 

d) an immunogenic fragment of an amino acid sequence selected from the group consisting 
of SEQ ID NO: 1-27. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l- 

27. 

3. An isolated polynucleotide encoding a polypeptide of claim 1 . 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
NO:28-54. 

6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim 1, the method comprising: 

a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 

b) recovering the polypeptide so expressed. 
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10. An isolated antibody which specifically binds to a polypeptide of claim 1. 

1 1. An isolated polynucleotide comprising a polynucleotide sequence selected from the 
group consisting of: 

5 a) a polynucleotide sequence selected from the group consisting of SEQ ID NO: 28-54, 

b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to s 
polynucleotide sequence selected from the group consisting of SEQ ID NO:28-54, 

c) a polynucleotide sequence complementary to a), 

d) a polynucleotide sequence complementary to b), and 
10 e) an RNA equivalent of a)-d). 

1 2. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1 . 



1 3. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 

b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof 



14. A method of claim 1 3, wherein the probe comprises at least 60 contiguous nucleotides. 

15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1 , the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 

b) detecting the presence or absence of said amplified target polynucleotide or fragment 
thereof, and, optionally, if present, the amount thereof 

16. A composition comprising an effective amount of a polypeptide of claim 1 and a 
pharmaceutical^ acceptable excipicnt 
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1 7. A composition of claim 1 6, wherein the polypeptide comprises an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 -27. 



1 8. A method for treating a disease or condition associated with decreased expression of 

5 functional ORP, comprising administering to a patient in need of such treatment the composition of 
claim 16. 

1 9. A method for screening a compound for effectiveness as an agonist of a polypeptide of 
claim 1 , the method comprising: 

10 a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 

20. A composition comprising an agonist compound identified by a method of claim 19 and 
a pharmaceutical^ acceptable excipient 

15 

21. A method for treating a disease or condition associated with decreased expression of 
functional ORP, comprising administering to a patient in need of such treatment a composition of 
claim 20. 



20 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1 , the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

25 23. A composition comprising an antagonist compound identified by a method of claim 22 

and a pharmaceutical^ acceptable excipient 

24. A method for treating a disease or condition associated with overexpression of functional 
ORP, comprising administering to a patient in need of such treatment a composition of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1 , said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 
conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
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compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
claim 1 , said method comprising: 

5 a) combining the polypeptide of claim 1 with at least one test compound under conditions 

permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 
10 with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 

in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

27. A method for screening a compound for effectiveness in altering expression of a target 
15 polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 

comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, under conditions 
suitable for the expression of the target polynucleotide, 

b) detecting altered expression of the target polynucleotide, and 

20 c) comparing the expression of the target polynucleotide in the presence of varying amounts of 

the compound and in the absence of the compound. 

28. A method for assessing toxicity of a test compound, said method comprising: 
a) treating a biological sample containing nucleic acids with the test compound; 

25 b) hybridizing the nucleic acids of the treated biological sample with a probe comprising at 

least 20 contiguous nucleotides of a polynucleotide of claim 1 1 under conditions whereby a specific 
hybridization complex is formed between said probe and a target polynucleotide in the biological 
sample, said target polynucleotide comprising a polynucleotide sequence of a polynucleotide of claim 
1 1 or fragment thereof; 

30 c) quantifying the amount of hybridization complex; and 

d) comparing the amount of hybridization complex in the treated biological sample with the 
amount of hybridization complex in an untreated biological sample, wherein a difference in the 
amount of hybridization complex in the treated biological sample is indicative of toxicity of the test 
compound. 
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Thr Gly Tyr 


Glu Thr Glu Ara Gin Ara His 

* *>***» w * v* 4 **» y Will y il*kj 












410 


415 


^^u 


Thr 


Ala 


Leu 


Leu 


Ala Ala Thr 


Asd Leu Leu Lvs Ara lieu Tvr 












425 


430 


die; 
ft j j 


Thr 


Ser 


Ala 


Ser 


Pro Leu Val 


Leu Leu Ara T*hr Tm f2 1 vr T.ou 


pin 










440 


445 


>i en 
ft -> U 


Ala 


Thr 


Asn 


Ala 


Val Ser Pro 


Leu Lvs Glu Gin He Met Ala 


Phe 










455 


460 


465 


Ala 


Ser 


Lys 











<210> 2 

<211> 254 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 907607CD1 

<400> 2 



Met 


Ala 


Ala 


Thr 


Ala Arg 


Arg 


Gly Trp Gly Ala Ala Ala Val 


Ala 


1 








5 




10 


15 


Ala 


Gly 


Leu 


Arg 


Arg Arg 


Phe 


Cys His Met Leu Lys Asn Pro 


Tyr 










20 




25 


30 


Thr 


He 


Lys 


Lys 


Gin Pro 


Leu 


His Gin Phe Val Gin Arg Pro 


Leu 










35 




40 


45 


Phe 


Pro 


Leu 


Pro 


Ala Ala 


Phe 


Tyr His Pro Val Arg Tyr Met 


Phe 










50 




55 


60 


lie 


Gin 


Thr 


Gin 


Asp Thr 


Pro 


Asn Pro Asn Ser Leu Lys Phe 


He 










65 




70 


75 


Pro 


Gly 


Lys 


Pro 


Val Leu 


Glu 


Thr Arg Thr Met Asp Phe Pro 


Thr 










80 




85 


90 


Pro 


Ala 


Ala 


Ala 


Phe Arg 


Ser 


Pro Leu Ala Arg Gin Leu Phe 


Arg 










95 




100 


105 


He 


Glu 


Gly 


Val 


Lys Ser 


Val 


Phe Phe Gly Pro Asp Phe He 


Thr 










110 




115 


120 


Val 


Thr 


Lys 


Glu 


Asn Glu 


Glu 


Leu Asp Trp Asn Leu Leu Lys 


Pro 










125 




13 0 


135 


Asp 


He 


Tyr 


Ala 


Thr He 


Met 


Asp Phe Phe Ala Ser Gly Leu 


Pro 
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140 145 150 
Leu Val Thr Glu Glu Thr Pro Ser Gly Glu Ala Gly Ser Glu Glu 

155 160 " 165 

Asp Asp Glu Val Val Ala Met lie Lys Glu Leu Leu Asp Thr Arg 

170 175 " 180 

He Arg Pro Thr Val Gin Glu Asp Gly Gly Asp Val He Tyr Lys 

185 190 195 

Gly Phe Glu Asp Gly He Val Gin Leu Lys Leu Gin Gly Ser Cys 

200 205 210 

Thr Ser Cys Pro Ser Ser He lie. Thr Leu Lys Asn Gly He Gin 

215 220 225 

Asn Met Leu Gin Phe Tyr He Pro Glu Val Glu Gly Val Glu Gin 

230 235 240 

Val Met Asp Asp Glu Ser Asp Glu Lys Glu Ala Asn Ser Pro 

245 250 



<210> 3 

<211> 555 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1290078CD1 

<400> 3 

Met Gin Ser Cys Glu Ser Ser Gly Asp Ser Ala Asp Asp Pro Leu 
1 5 10 15 

Ser Arg Gly Leu Arg Arg Arg Gly Gin Pro Arg Val Val Val He 
20 25 30 

Gly Ala Gly Leu^Ala Gly Leu Ala Ala Ala Lys Ala Leu Leu Glu 
35 40 45 

Gin Gly Phe Thr Asp Val Thr Val Leu Glu Ala Ser Ser His He 
50 55 60 

Gly Gly Arg Val Gin Ser Val Lys Leu Gly His Ala Thr Phe Glu 
65 70 75 

Leu Gly Ala Thr Trp He His Gly Ser His Gly Asn Pro He Tyr 
80 85 90 

His Leu Ala Glu Ala Asn Gly Leu Leu Glu Glu Thr Thr Asp Gly 
95 100 105 

Glu Arg Ser Val Gly Arg He Ser Leu Tyr Ser Lys Asn Gly Val 

HO 115 120 

Ala Cys Tyr Leu Thr Asn His Gly Arg Arg He Pro Lys Asp Val 

125 130 135 

Val Glu Glu Phe Ser Asp Leu Tyr Asn Glu Val Tyr Asn Leu Thr 

140 145 150 

Gin Glu Phe Phe Arg His Asp Lys Pro Val Asn Ala Glu Ser Gin 

155 160 165 

Asn Ser Val Gly Val Phe Thr Arg Glu Glu Val Arg Asn Arg He 

170 175 180 

Arg Asn Asp Pro Asp Asp Pro Glu Ala Thr Lys Arg Leu Lys Leu 

185 190 195 

Ala Met He Gin Gin Tyr Leu Lys Val Glu Ser Cys Glu Ser Ser 

200 205 210 

Ser His Ser Met Asp Glu Val Ser Leu Ser Ala Phe Gly Glu Trp 

215 220 225 

Thr Glu He Pro Gly Ala His His He He Pro Ser Gly Phe Met 

230 235 240 

Arg Val Val Glu Leu Leu Ala Glu Gly He Pro Ala His Val He 

245 250 255 

Gin Leu Gly Lys Pro Val Arg Cys He His Trp Asp Gin Ala Ser 

260 265 270 

Ala Arg Pro Arg Gly Pro Glu He Glu Pro Arg Gly Glu Gly Asp 

275 280 ^ 285 
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His 


Asn 


His Asp 


Thr 


Gly 


Glu Gly Gly Gin Gly Gly Glu Glu Pro 








290 








295 


inn 


Arg 


Gly 


Gly Arg 


Trp 


Asp 


Glu Asp Glu Gin 


irp s>er vai vai vai. 








305 








310 


j id 


Glu 


Cys 


Glu Asp 


cys 


Glu 


Leu He 


Pro 


Ala 


Asp mis vai ne vai 








320 








325 


*J "3 f\ 


Thr 


Val 


Ser Leu 


Gly Val 


Leu Lys Arg Gin 


iyx inr ser riie rne 








335 








340 




Arg 


Pro 


Gly Leu 


Pro 


Thr 


Glu Lys Val 


Ala 


Ala lie rilS Arg jjeu 








350 








355 


7 £ f\ 


Gly 


He 


Gly Thr 


Thr 


Asp 


Lys He 


Phe 


Leu 


vjiu Fne biu tsiu irro 








365 








370 


J / D 


Phe 


Tip 


Gly Pro 


Glu 


Cys 


Asn Ser 


Leu 


Gin 


jrne vai ixp giu Asp 








380 








385 


"300 
J7U 


Glu 


Ala 


Glu Ser 


His 


Thr 

i. Hi- 


Leu Thr Tyr 


Pro 


riO Vj±U bcU 1 1 p iyr 








395 








400 


3 UD 


Arg 


Lys 


He Cys 


Gly 


Phe 


Asp Val 


Leu 




riO riO G1U ATy iyr 








410 








415 


,4 o rv 


Gly 


His 


Val Leu 


Ser 


Gly 


Trp He 


Cys 




bill blU nla L6U Val 








425 








H. J V/ 


*± J J 


Met 


GlU 


Lys Cys 


Asp 


Asp 


Glu Ala 


Val 


Aid 


111 Tl P* r em rpVx -v- P It* 

v»iu lie uys inr giu 








440 








445 




Met 


Leu 


Arg Gin 


Phe 


Thr 


Gly Asn 


Pro 


Asn 


lie irro jjys Fro Arg 








455 








460 




Arg 


He 


Leu Arg 




Ala 


Trp Gly 


Ser 


Asn 


rTo iyr rue Arg Giy 








470 








475 




Ser 


Tyr 


Ser Tyr 


Thr 


Gin 


Val Gly 


Ser 


Ser 


Giy Ala Asp vai GIU 








485 








490 


495 


Lys 


Leu 


Ala Lys 


Pro 


Leu 


Pro Tyr 


Thr 


Glu 


Ser Ser Lys Thr Ala 








500 








505 


510 


Pro 


Met 


Gin Val 


Leu 


Phe 


Ser Gly Glu Ala Thr His Arg Lys Tyr 








515 








520 


525 


Tyr 


Ser 


Thr Thr 


His 


Gly 


Ala Leu Leu Ser Gly Gin Arg Glu Ala 








530 








535 


540 


Ala 


Arg 


Leu He 


Glu 


Met 


Tyr Arg Asp Leu Phe Gin Gin Gly Thr 








545 








550 


555 



<210> 4 

<211> 337 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1302741CD1 

<400> 4 



Met 


Ala 


Leu 


Gin 


Thr 


Leu 


Gin 


Ser Ser Trp Val Thr Phe Arg Lys 


1 








5 






10 


15 


He 


Leu 


Ser 


His 


Phe 


Pro 


Glu 


Glu Leu Ser 


Leu Ala Phe Val Tyr 










20 






25 


30 


Gly 


Ser 


Gly 


Val 


Tyr 


Arg 


Gin 


Ala Gly Pro 


Ser Ser Asp Gin Lys 










35 






40 


45 


Asn 


Ala 


Met 


Leu 


Asp 


Phe 


Val 


Phe Thr Val 


Asp Asp Pro Val Ala 










50 






55 


60 


Trp 


His 


Ser 


Lys 


Asn 


Leu 


Lys 


Lys Asn Trp Ser His Tyr Ser Phe 










65 






70 


75 


Leu 


Lys 


Val 


Leu 


Gly Pro 


Lys 


He He Thr 


Ser He Gin Asn Asn 










80 






85 


90 


Tyr 


Gly 


Ala 


Gly 


Val 


Tyr 


Tyr 


Asn Ser Leu 


He Met Cys Asn Gly 










95 






100 


105 


Arg 


Leu 


He 


Lys 


Tyr 


Gly 


Val 


He Ser Thr 


Asn Val Leu lie Glu 










110 






115 


120 


Asp 


Leu 


Leu 


Asn 


Trp 


Asn 


Asn 


Leu Tyr He 


Ala Gly Arg Leu Gin 
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130 135 
1 Asn Glu Asp Val Thr Leu Arg 
145 150 
s Ser Ala Val Thr Ala Ala Phe 
160 165 
r Glu Glu Asp Leu Phe He Glu 
175 180 
/ Asp Phe Arg Met Val Val Gly 
190 195 
a He Val Lys Pro Asn He Ala 
205 210 
r He Leu Gin Glu Asn Pro Gin 
220 225 
/ Trp Leu Glu He Asp Lys Ser 
235 240 
i Met Thr Leu Pro Lys Thr Leu 
250 255 
t Asp Pro Pro Gly Lys Asn Arg 
265 270 
i Val Ala His Asp Pro Asp Cys 
280 285 
x Ser Ala He Val Arg Pro Ser 
295 300 
f He Phe Thr Ala Gly Leu Lys 
310 315 
l Lys Leu His Lys Met Trp Lys 
325 330 







125 






Lys Pro 


Val Lys 


He 


He 


Ser 






140 






Ser Ala 


Leu Asp 


Arg Asn 


Leu 






155 






Leu Met 


Leu Pro 


Glu 


Ser 


Phe 






170 






He Ala 


Gly Leu 


Ser 


Tyr 


Ser 






185 






Glu Asp 


Lys Thr 


Lys 


Val 


Leu 






200 






His Phe 


Arg Glu 


Leu Tyr Gly 






215 






Val Val 


Tyr Lys 


Ser 


Gin 


Gin 






230 






Pro Glu 


Gly Gin 


Phe 


Thr 


Gin 






245 






Gin Gin 


Gin He 


Asn 


His 


He 






260 






Asp Val 


Glu Glu 


Thr 


Leu 


Phe 






275 






Gly Asp 


Val Val 


Arg Leu Gly 






290 






Ser He 


Arg Gin 


Ser 


Thr Lys 






305 






Lys Ser 


Val He 


Tyr 


Ser 


Ser 






320 






Gly Trp 


Leu Arg 


Lys 


Thr 


Ser 






335 







<210> 5 

<211> 109 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1541028CD1 

<400> 5 

Met Ser Ala Asn Thr Phe Gly Asn Ala Gly Phe Ser Val Leu Leu 
1 5 10 is 

Pro Gly Ala Arg Leu Glu Gly Arg Cys Gly Pro Thr Asn Ala Arg 

20 25 30 

Val Arg Cys His Leu Gly Leu Lys He Pro Pro Gly Cys Glu Leu 

35 40 45 

Val Val Gly Gly Glu Pro Gin Cys Trp Ala Glu Gly His Cys Leu 

50 55 60 

Leu Val Asp Asp Ser Phe Leu His Thr Val Ala His Asn Gly Ser 

65 70 75 

Pro Glu Asp Gly Pro Arg Val Val Phe He Val Asp Leu Trp His 

80 85 90 

Pro Asn Val Ala Gly Ala Glu Arg Gin Ala Leu Asp Phe Val Phe 

95 100 105 

Ala Pro Asp Pro 



<210> 6 
<211> 385 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
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<223> Incyte ID No: 1597687CD1 
<400> 6 

Met Lys Met Leu Pro Gly Val Gly Val Phe Gly Thr Gly Ser Ser 
1 5 10 15 

Ala Arg Val Leu Val Pro Leu Leu Arg Ala Glu Gly Phe Thr Val 
20 25 30 

Glu Ala Leu Trp Gly Lys Thr Glu Glu Glu Ala Lys Gin Leu Ala 
35 40 * 45 

Glu Glu Met Asn He Ala Phe Tyr Thr Ser Arg Thr Asp Asp He 
50 55 ~ 60 

Leu Leu His Gin Asp Val Asp Leu Val Cys He Ser He Pro Pro 
65 70 75 

Pro Leu Thr Arg Gin He Ser Val Lys Ala Leu Gly He Gly Lys 
80 85 ^ 90 

Asn Val Val Cys Glu Lys Ala Ala Thr Ser Val Asp Ala Phe Arg 
95 100 ^ 105 

Met Val Thr Ala Ser Arg Tyr Tyr Pro Gin Leu Met Ser Leu Val 
HO 115 120 

Gly Asn Val Leu Arg Phe Leu Pro Ala Phe Val Arg Met Lys Gin 
125 130 135 

Leu He Ser Glu His Tyr Val Gly Ala Val Met He Cys Asp Ala 
140 145 ^ 150 

Arg He Tyr Ser Gly Ser Leu Leu Ser Pro Ser Tyr Gly Trp He 

155 160 * 165 

Cys Asp Glu Leu Met Gly Gly Gly Gly Leu His Thr Met Gly Thr 

170 175 180 

Tyr He Val Asp Leu Leu Thr His Leu Thr Gly Arg Arg Ala Glu 

185 190 195 

Lys Val His Gly Leu Leu Lys Thr Phe Val Arg Gin Asn Ala Ala 

200 205 210 

He Arg Gly lie Arg His Val Thr Ser Asp Asp Phe Cys Phe Phe 

215 220 ' 225 

Gin Met Leu Met Gly Gly Gly Val Cys Ser Thr Val Thr Leu Asn 

230 235 240 

Phe Asn Met Pro Gly Ala Phe Val His Glu Val Met Val Val Gly 

245 250 255 

Ser Ala Gly Arg Leu Val Ala Arg Gly Ala Asp Leu Tyr Gly Gin 

260 265 270 

Lys Asn Ser Ala Thr Gin. Glu Glu Leu Leu Leu Arg Asp Ser Leu 

275 280 285 

Ala Val Gly Ala Gly Leu Pro Glu Gin Gly Pro Gin Asp Val Pro 

290 295 300 

Leu Leu Tyr Leu Lys Gly Met Val Tyr Met Val Gin Ala Leu Arg 

305 310 315 

Gin Ser Phe Gin Gly Gin Gly Asp Arg Arg Thr Trp Asp Arg Thr 

320 325 330 

Pro Val Ser Met Ala Ala Ser Phe Glu Asp Gly Leu Tyr Met Gin 

335 340 * 345 

Ser Val Val Asp Ala He Lys Arg Ser Ser Arg Ser Gly Glu Trp 

350 355 360 

Glu Ala Val Glu Val Leu Thr Glu Glu Pro Asp Thr Asn Gin Asn 

365 370 375 

Leu Cys Glu Ala Leu Gin Arg Asn Asn Leu 

380 385 



<210> 7 

<211> 312 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1690348CD1 
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<400> 7 

Met Ala Cys Ala Ala Val Met 

1 5 
Val Gly Ala He Arg He Glu 
20 

Ser Thr Leu Arg His Leu Thr 
35 

Arg Lys Thr Asp His Met Glu 
50 

Glu He Val Ala Ala Ala Lys 
65 

Pro Ser Val Lys Ser Leu Arg 
80 

Ser Phe Lys Ala Gly Gin Trp 
95 

Ser Val Val Gly Gly Phe Ser 
110 

Glu Gin Glu Arg Val He Glu 
125 

Pro Pro Ala Leu Trp Val His 
140 

Val Ala Val Arg Val Gly Gly 
155 

Ala Asp Ala Ser Arg Asn Leu 
170 

He Asn Pro Leu Leu Ser He 
185 

Arg Glu Gin Ala Asn Lys Arg 
200 

Lys Leu Phe Tyr Ser Ala Lys 
215 

Lys Asn He Leu Asp Leu Val 
230 

Cys Ser Leu His Val Thr Lys 
245 

Leu Lys Pro Tyr He Thr Glu 
260 

Arg Asp His He Ser Lys Glu 
275 

Pro Pro Met Thr Asp Phe Phe 

* 290 

Val Pro Lys Glu His He Cys 
305 



lie 


Pro 


c>iy 


Leu Leu Arg Cys Ser 






1U 


15 


Ala 
Aia 


Aia 


ber 


Leu Arg Leu Thr Leu 






25 




Leu 


Thr 


ser 


lie Met Lys ser Lys 






40 


4b 


Arg Thr Ala 


>er vai Leu Arg Arg 






5b 


60 


vai 


Cys 


Gly 


Ala Ala Ser Glu Ser 






70 


75 


Leu 


Leu 


Val 


Ala Asp Gin Asp Phe 






85 


90 


vai 


Asp 


Pne 


Fne lie Fro Gly Val 






100 


105 


He 


cys 


Ser 


Ser Pro Arg Leu Leu 






115 


120 


Leu 


Ala 


Val 


Lys Tyr Thr Asn His 






130 


135 


Asn 


Thr 


Cys 


Thr Leu Asp Cys Glu 






145 


ICO 

150 


Glu 


Phe 


Phe 


Phe Asp Pro Gin Pro 






160 


165 


Val 


Leu 


He 


Ala Gly Gly Val Gly 






175 


180 


Leu Arg 


His 


Ala Ala Asp Leu Leu 






190 


195 


Asn Gly Tyr 


Glu He Gly Thr He 






205 


210 


Asn 


Thr 


Ser 


Glu Leu Leu Phe Lys 






220 


225 


Asn 


Glu 


Phe 


Pro Glu Lys He Ala 






235 


240 


Gin 


Thr 


Thr 


Gin lie Asn Ala Glu 






250 


255 


Gly Arg 


He 


Thr Glu Lys Glu He 






265 


270 


Thr 


Leu 


Phe 


Tyr He Cys Gly Pro 






280 


285 


Ser 


Lys 


Gin 


Leu Glu Asn Asn His 






295 


300 


Phe 


Glu 


Lys 


Trp Trp 






310 





<210> 8 

<211> 160 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1865603CD1 

<400> 8 

Met Leu Arg His Leu Pro Ser Arg Leu Pro Val Lys Met Trp Gly 
15 10 15 

Arg Thr Leu Glu Lys Gin Ser Trp Arg Asp Ser Ser Gin Thr Pro 

20 25 30 

Pro Pro Cys Leu He Arg Arg Leu Asp His He Val Met Thr Val 

35 40 45 

Lys Ser He Lys Asp Thr Thr Met Phe Tyr Ser Lys He Leu Gly 

50 55 " 60 

Met Glu Val Met Thr Phe Lys Glu Asp Arg Lys Ala Leu Cys Phe 

65 70 75 
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Gly Asp Gin Lys Phe Asn Leu His Glu Val Gly Lys Glu Phe Glu 

80 85 90 

Pro Lys Ala Ala His Pro Val Pro Gly Ser Leu Asp He Cys Leu 

95 100 ^ 105 

He Thr Glu Val Pro Leu Glu Glu Met He Gin His Leu Lys Ala 

110 115 120 

Cys Asp Val Pro He Glu Glu Gly Pro Val Pro Arg Thr Gly Ala 

125 130 ~ 135 

Lys Gly Pro He Met Ser He Tyr Phe Arg Asp Pro Asp Arg Asn 

140 145 150 

Leu He Glu Val Ser Asn Tyr He Ser Ser 

155 160 



<210> 9 

<211> 487 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1976472CD1 

<400> 9 

Met Asp Trp He Arg Leu Thr Lys Ser Gly Lys Asp Leu Thr Gly 
1 5 10 15 

Leu Lys Gly Arg Leu He Glu Val Thr Glu Glu Glu Leu Lys Lys 
20 25 ~ 30 

His Asn Lys Lys Asp Asp Cys Trp He Cys He Arg Gly Phe Val 
35 40 " 45 

Tyr Asn Val Ser Pro Tyr Met Glu Tyr His Pro Gly Gly Glu Asp 
50 55 60 

Glu Leu Met Arg Ala Ala Gly Ser Asp Gly Thr Glu Leu Phe Asp 
65 70 75 

Gin Val His Arg Trp Val Asn Tyr Glu Ser Met Leu Lys Glu Cys 
80 85 90 

Leu Val Gly Arg Met Ala He Lys Pro Ala Val Leu Lys Asp Tyr 
95 100 105 

Arg Glu Glu Glu Lys Lys Val Leu Asn Gly Met Leu Pro Lys Ser 

110 115 120 

Gin Val Thr Asp Thr Leu Ala Lys Glu Gly Pro Ser Tyr Pro Ser 

125 130 135 

Tyr Asp Trp Phe Gin Thr Asp Ser Leu Val Thr He Ala He Tyr 

140 145 150 

Thr Lys Gin Lys Asp He Asn Leu Asp Ser He He Val Asp His 

155 160 165 

Gin Asn Asp Ser Phe Arg Ala Glu Thr He He Lys Asp Cys Leu 

170 175 180 

Tyr Leu He His lie Gly Leu Ser His Glu Val Gin Glu Asp Phe 

185 190 195 

Ser Val Arg Val Val Glu Ser Val Gly Lys He Glu He Val Leu 

200 205 210 

Gin Lys Lys Glu Asn Thr Ser Trp Asp Phe Leu Gly His Pro Leu 

215 220 225 

Lys Asn His Asn Ser Leu He Pro Arg Lys Asp Thr Gly Leu Tyr 

230 235 240 

Tyr Arg Lys Cys Gin Leu He Ser Lys Glu Asp Val Thr His Asp 

245 250 255 

Thr Arg Leu Phe Cys Leu Met Leu Pro Pro Ser Thr His Leu Gin 

260 265 270 

Val Pro He Gly Gin His Val Tyr Leu Lys Leu Pro He Thr Gly 

275 280 285 

Thr Glu He Val Lys Pro Tyr Thr Pro Val Ser Gly Ser Leu Leu 

290 295 300 

Ser Glu Phe Lys Glu Pro Val Leu Pro Asn Asn Lys Tyr He Tyr 
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305 


Pne 


Leu 


He Lys 


He Tyr Pro 








320 


Asp 


Arg 


Leu Gin 


He Gly Asp 








335 


Gly 


Asn 


Phe Lys 


He Ser Lys 








350 


Leu 


Leu 


Ala Ala 


Gly Thr Gly 








365 


Asn 


Tyr 


Ala Leu 


Thr Asp He 








380 


Met 


Phe 


Phe Asn 


Lys Thr Glu 








395 


Leu 


Glu 


Lys Leu 


Ala Phe Lys 








410 


Val 


Leu 


Ser Ala 


Pro He Ser 








425 


lie 


Ser 


Pro Ala 


Leu Leu Ser 








440 


Lys 


Ser 


Lys Val 


Leu Val Cys 








455 


Glu 


Gin 


Gly Val 


Arg Leu Leu 








470 


Glu 


He 


His Ser 


Phe Thr Ala 








485 



310 




315 


Thr Gly Leu Phe Thr Pro 


Glu 


Leu 


325 




330 


Phe Val Ser Val Ser Ser 


Pro 


Glu 


340 




345 


Phe Gin Glu Leu Glu Asp 


Leu 


Phe 


355 




360 


Phe Thr Pro Met Val Lys 


lie 


Leu 


370 




375 


Pro Ser Leu Arg Lys Val 


Lys 


Leu 


385 


390 


Asp Asp He lie Trp Arg 


Ser 


Gin 


400 




405 


Asp Lys Arg Leu Asp Val 


Glu 


Phe 


415 




420 


Glu Trp Asn Gly Lys Gin 


Gly His 


430 




435 


Glu Phe Leu Lys Arg Asn 


Leu Asp 


445 




450 


lie Cys Gly Pro Val Pro 


Phe 


Thr 


460 




465 


His Asp Leu Asn Phe Ser 


Lys 


Asn 


475 


480 



<210> 10 

<211> 524 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2050821CD1 

<400> 10 

Met Ser Leu Leu Ser Leu Pro Trp Leu Gly Leu Arg Pro Val Ala 
15 10 15 

Met Ser Pro Trp Leu Leu Leu Leu Leu Val Val Gly Ser Trp Leu 
20 25 " 30 

Leu Ala Arg lie Leu Ala Trp Thr Tyr Ala Phe Tyr Asn Asn Cys 
35 40 45 

Arg Arg Leu Gin Cys Phe Pro Gin Pro Pro Lys Arg Asn Trp Phe 
50 55 60 

Trp Gly His Leu Gly Leu He Thr Pro Thr Glu Glu Gly Leu Lys 
65 70 ' 75 

Asp Ser Thr Gin Met Ser Ala Thr Tyr Ser Gin Gly Phe Thr Val 
80 85 90 

Trp Leu Gly Pro lie lie Pro Phe He Val Leu Cys His Pro Asp 
95 100 105 

Thr He Arg Ser lie Thr Asn Ala Ser Ala Ala lie Ala Pro Lys 
110 115 120 

Asp Asn Leu Phe lie Arg Phe Leu Lys Pro Trp Leu Gly Glu Gly 
125 130 135 

lie Leu Leu Ser Gly Gly Asp Lys Trp Ser Arg His Arg Arg Met 
140 145 ~ 150 

Leu Thr Pro Ala Phe His Phe Asn lie Leu Lys Ser Tyr He Thr 
155 160 165 

lie Phe Asn Lys Ser Ala Asn lie Met Leu Asp Lys Trp Gin His 
170 175 *" 180 

Leu Ala Ser Glu Gly Ser Ser Arg Leu Asp Met Phe Glu His lie 
185 190 195 

Ser Leu Met Thr Leu Asp Ser Leu Gin Lys Cys He Phe Ser Phe 
200 205 210 
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Asp ser His Cys Gin Glu Arg Pro Ser Glu Tyr He Ala Thr lie 

215 220 225 

Leu Glu Leu Ser Ala Leu Val Glu Lys Arg Ser Gin His He Leu 

230 235 240 

Gin His Met Asp Phe Leu Tyr Tyr Leu Ser His Asp Gly Arg Ara 

245 250 255 

Phe His Arg Ala Cys Arg Leu Val His Asp Phe Thr Asp Ala Val 
. 260 265 270 

He Arg Glu Arg Arg Arg Thr Leu Pro Thr Gin Gly He Asp Asp 

275 280 285 

Phe Phe Lys Asp Lys Ala Lys Ser Lys Thr Leu Asp Phe He Asp 

290 295 300 

Val Leu Leu Leu Ser Lys Asp Glu Asp Gly Lys Ala Leu Ser Asp 

305 310 315 

Glu Asp He Arg Ala Glu Ala Asp Thr Phe Met Phe Gly Gly His 

320 325 330 

Asp Thr Thr Ala Ser Gly Leu Ser Trp Val Leu Tyr Asn Leu Ala 

335 340 345 

Arg His Pro Glu Tyr Gin Glu Arg Cys Arg Gin Glu Val Gin Glu 
_ r 350 355 360 

Leu Leu Lys Asp Arg Asp Pro Lys Glu He Glu Trp Asp Asp Leu 

365 370 * 375 

Ala Gin Leu Pro Phe Leu Thr Met Cys Val Lys Glu Ser Leu Arg 

380 385 390 

Leu His Pro Pro Ala Pro Phe He Ser Arg Cys Cys Thr Gin Asp 

395 400 * 405 

He Val Leu Pro Asp Gly Arg Val He Pro Lys Gly He Thr Cys 

410 415 ~ 420 

Leu He Asp He He Gly Val His His Asn Pro Thr Val Trp Pro 

425 430 435 

Asp Pro Glu Val Tyr Asp Pro Phe Arg Phe Asp Pro Glu Asn Ser 

440 445 450 

Lys Gly Arg Ser Pro Leu Ala Phe He Pro Phe Ser Ala Gly Pro 

455 460 465 

Arg Asn Cys He Gly Gin Ala Phe Ala Met Ala Glu Met Lys Val 

470 475 480 

Val Leu Ala Leu Met Leu Leu His Phe Arg Phe Leu Pro Asp His 

485 490 495 

Thr Glu Pro Arg Arg Lys Leu Glu Leu He Met Arg Ala Glu Gly 

500 505 510 

Gly Leu Trp Leu Arg Val Glu Pro Leu Asn Val Gly Leu Gin 

515 520 



<210> 11 

<211> 144 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2408443CD1 

<400> 11 

Met Val Thr Leu Tyr Ser Ser Pro Ser Cys Thr Ser Cys Arg Lys 
1 5 10 is 

Ala Lys Gin Trp Leu Val Asp His Asn Leu Pro Phe He Glu Arg 

20 25 30 

Asn Leu Asn Lys Glu Pro Leu Arg Ala Glu Asp Val Lys Ala Met 

35 40 45 

Leu Arg Leu Thr Glu Asp Gly Thr Glu Glu Leu He Ser Thr Ara 

50 55 60 

Ser Lys He Phe Ser Glu Leu Thr He Asp Leu Asp Asp Met Ser 

65 70 75 

He Asn Lys Leu He Asp Leu He Val Met Tyr Pro Ser Leu Leu 
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80 85 90 

Lys Arg Pro lie lie Leu Asp Asp Gin Arg Met Gin He Gly Tyr 

95 100 105 

Asn Asp Asp Glu He Arg Arg Phe Leu Pro Arg Glu Val Arg Gin 

110 115 120 

Arg Glu Leu He Arg Ala Thr Phe Lys Ala Asp Phe Ala Glu Glu 

125 130 135 
Ala Lys Asp Leu Val Val Glu Glu Gly 
140 



<210> 12 

<211> 373 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2508668CD1 

<400> 12 

Met Trp Val Cys Ser Thr Leu Trp Arg Val Arg Thr Pro Ala Arg 
1 5 10 15 

Gin Trp Arg Gly Leu Leu Pro Ala Ser Gly Cys His Gly Pro Ala 
20 25 ~ 3 0 

Ala Ser Ser Tyr Ser Ala Ser Ala Glu Pro Ala Arg Val Arg Ala 
35 40 45 

Leu Val Tyr Gly His His Gly Asp Pro Ala Lys Val Val Glu Leu 
50 55 60 

Lys Asn Leu Glu Leu Ala Ala Val Arg Gly Ser Asp Val Arg Val 
65 70 75 

Lys Met Leu Ala Ala Pro He Asn Pro Ser Asp He Asn Met He 
80 85 90 

Gin Gly Asn Tyr Gly Leu Leu Pro Glu Leu Pro Ala Val Gly Gly 
95 100 105 

Asn Glu Gly Val Ala Gin Val Val Ala Val Gly Ser Asn Val Thr 

110 115 120 

Gly Leu Lys Pro Gly Asp Trp Val He Pro Ala Asn Ala Gly Leu 

125 130 135 

Gly Thr Trp Arg Thr Glu Ala Val Phe Ser Glu Glu Ala Leu He 

140 145 150 

Gin Val Pro Ser Asp He Pro Leu Gin Ser Ala Ala Thr Leu Gly 

155 160 165 

Val Asn Pro Cys Thr Ala Tyr Arg Met Leu Met Asp Phe Glu Gin 

170 175 180 

Leu Gin Pro Gly Asp Ser Val He Gin Asn Ala Ser Asn Ser Gly 

185 190 195 

Val Gly Gin Ala Val He Gin He Ala Ala Ala Leu Gly Leu Arg 

200 205 " 210 

Thr He Asn Val Val Arg Asp Arg Pro Asp He Gin Lys Leu Ser 

215 220 225 

Asp Arg Leu Lys Ser Leu Gly Ala Glu His Val He Thr Glu Glu 

230 235 240 

Glu Leu Arg Arg Pro Glu Met Lys Asn Phe Phe Lys Asp Met Pro 

245 250 & 255 

Gin Pro Arg Leu Ala Leu Asn Cys Val Gly Gly Lys Ser Ser Thr 

260 265 270 

Glu Leu Leu Arg Gin Leu Ala Arg Gly Gly Thr Met Val Thr Tyr 

275 280 285 

Gly Gly Met Ala Lys Gin Pro Val Val Ala Ser Val Ser Leu Leu 

290 295 300 

He Phe Lys Asp Leu Lys Leu Arg Gly Phe Trp Leu Ser Gin Trp 

305 310 315 

Lys Lys Asp His Ser Pro Asp Gin Phe Lys Glu Leu He Leu Thr 

320 325 330 
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» 

Leu Cys Asp Leu He Arg Arg Gly Gin Leu Thr Ala Pro Ala Cys 
335 340 345 

Ser Gin Val Pro Leu Gin Asp Tyr Gin Ser Ala Leu Glu Ala Ser 
350 355 360 

Met Lys Pro Phe He Ser Ser Lys Gin He Leu Thr Met 
365 370 



<210> 13 

<211> 305 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2536830CD1 

<400> 13 



Met Ala 


Lys 


Phe 


Val He Ala 


Gly Arg 


Ala 


Asp 


Cys Pro Tyr 


Tyr 


1 






5 




10 




15 


Ala Lys 


Thr 


Glu 


Leu Val Ala 


Asp Tyr 


Leu 


Gin 


Lys Asn Leu 


Pro 








20 




25 




30 


Asp Phe Arg 


r Ile 


His Lys He 


Thr Gin Arg 


Pro 


Glu Val Trp 


Glu 








35 




40 




45 


Asp Trp 


Leu 


Lys 


Asp Val Cys 


Glu Lys 


Asn 


Lys 


Trp Ser His 


Lys 








50 




55 




60 


Asn Ser 


Pro 


He 


He Trp Arg 


Glu Leu 


Leu 


Asp 


Arg Gly Gly Lys 








65 




70 






75 


Gly Leu Leu 


Leu 


Gly Gly Tyr 


Asn Glu 


Phe 


Leu 


Glu His Ala 


Gin 








80 




85 






90 


Leu Tyr 


Tyr 


Asp 


Val Thr Ser 


Ser Met 


Thr 


Thr 


Glu Leu Met 


Met 


Val He 






95 




100 






105 


Ala 


Gin 


Glu Asn Leu 


Gly Ala His 


He 


Glu Lys Glu Gin 








110 




115 






120 


Glu Glu 


Glu 


Ala 


Leu Lys Thr 
125 


Cys He 


Asn 
130 


Pro 


Leu Gin Val 


Trp 
135 


He Thr 


Ser 


Ala 


Ser Ala Pro 


Ala Cys Tyr Asn 


Leu He Pro 


He 








140 




145 






150 


Leu Thr 


Ser 


Gly Glu Val Phe 


Gly Met His 


Thr 


Glu He Ser 


He 


Thr Leu 






155 




160 






165 


Phe 


Asp Asn Lys Gin 


Ala Glu 


Glu 


His 


Leu Lys Ser 


Leu 


Val Val 






170 




175 




180 


Glu 


Thr Gin Asp Leu 


Ala Ser 


Pro 


Val 


Leu Arg Ser 


Val 








185 




190 




195 


Ser He 


Cys 


Thr 


Lys Val Glu 


Glu Ala 


Phe 


Arg 


Gin Ala His 


Val 


He Val 






200 




205 




210 


Val 


Leu Asp Asp Ser 


Thr Asn 


Lys 


Glu 


Val Phe Thr 


Leu 








215 




220 






225 


Glu Asp Cys 


Leu Arg Ser Arg 


Val Pro 


Leu 


Cys 


Arg Leu Tyr Gly 








230 




235 






240 


Tyr Leu 


He 


Glu 


Lys Asn Ala 


His Glu 


Ser 


Val 


Arg Val He 


Val 








245 




250 




255 


Gly Gly Arg 


Thr 


Phe Val Asn 


Leu Lys 


Thr 


Val 


Leu Leu Met 


Arg 








260 




265 






270 


Tyr Ala Pro 


Arg 


He Ala His 


Asn He 


He 


Ala 


Val Ala Leu Gly 








275 




280 






285 


Val Glu Gly 


Glu 


Ala Lys Ala 


He Leu Ala Arg 


Lys Leu Lys 


Thr 








290 




295 




300 


Ala Pro 


Ser 


Cys 


Glu 
















305 













<210> 14 
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<211> 500 
<212> PRT 
<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2645179CD1 

<400> 14 

Met Glu Ala Ala Arg Pro Pro Pro Thr Ala Gly Lys Phe Val Val 
1 5 10 15 

Val Gly Gly Gly He Ala Gly Val Thr Cys Ala Glu Gin Leu Ala 
20 25 30 

Thr His Phe Pro Ser Glu Asp He Leu Leu Val Thr Ala Ser Pro 
35 40 45 

Val He Lys Ala Val Thr Asn Phe Lys Gin He Ser Lys He Leu 
50 55 60 

Glu Glu Phe Asp Val Glu Glu Gin Ser Ser Thr Met Leu Gly Lys 
65 70 75 

Arg Phe Pro Asn He Lys Val He Glu Ser Gly Val Lys Gin Leu 
80 85 90 

Lys Ser Glu Glu His Cys He Val Thr Glu Asp Gly Asn Gin His 
95 100 105 

Val Tyr Lys Jjys Leu Cys Leu Cys Ala Gly Ala Lys Pro Lys Leu 
110 115 120 

He Cys Glu Gly Asn Pro Tyr Val Leu Gly He Arg Asp Thr Asp 
125 130 * 135 

Ser Ala Gin Glu Phe Gin Lys Gin Leu Thr Lys Ala Lys Arg He 
140 145 ~ 150 

Met He He Gly Asn Gly Gly He Ala Leu Glu Leu Val Tyr Glu 
155 160 165 

He Glu Gly Cys Glu Val He Trp Ala He Lys Asp Lys Ala He 
170 175 180 

Gly Asn Thr Phe Phe Asp Ala Gly Ala Ala Glu Phe Leu Thr Ser 
185 190 195 

Lys Leu He Ala Glu Lys Ser Glu Ala Lys He Ala His Lys Arg 
200 205 210 

Thr Arg Tyr Thr Thr Glu Gly Arg Lys Lys Glu Ala Arg Ser Lys 
215 220 225 

Ser Lys Ala Asp Asn Val Gly Ser Ala Leu Gly Pro Asp Trp His 
230 235 ^ 240 

Glu Gly Leu Asn Leu Lys Gly Thr Lys Glu Phe Ser His Lys He 
245 250 255 

His Leu Glu Thr Met Cys Glu Val Lys Lys He Tyr Leu Gin Asp 
260 265 ~ 270 

Glu Phe Arg He Leu Lys Lys Lys Ser Phe Thr Phe Pro Arg Asp 
275 280 285 

His Lys Ser Val Thr Ala Asp Thr Glu Met Trp Pro Val Tyr Val 
290 295 300 

Glu Leu Thr Asn Glu Lys He Tyr Gly Cys Asp Phe He Val Ser 
305 310 315 

Ala Thr Gly Val Thr Pro Asn Val Glu Pro Phe Leu His Gly Asn 
320 325 330 

Ser Phe Asp Leu Gly Glu Asp Gly Gly Leu Lys Val Asp Asp His 
335 340 345 

Met His Thr Ser Leu Pro Asp He Tyr Ala Ala Gly Asp He Cys 
350 355 360 

Thr Thr Ser Trp Gin Leu Ser Pro Val Trp Gin Gin Met Arg Leu 
365 370 375 

Trp Thr Gin Ala Arg Gin Met Gly Trp Tyr Ala Ala Lys Cys Met 
380 385 * 390 

Ala Ala Ala Ser Ser Gly Asp Ser He Asp Met Asp Phe Ser Phe 
395 400 405 

Glu Leu Phe Ala His Val Thr Lys Phe Phe Asn Tyr Lys Val Val 
410 415 420 

Leu Leu Gly Lys Tyr Asn Ala Gin Gly Leu Gly Ser Asp His Glu 
425 430 " 435 
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Leu Met Leu Arg Cys Thr Lys Gly Arg Glu Tyr lie Lys Val Val 

440 445 ~ 450 

Met Gin Asn Gly Arg Met Met Gly Ala Val Leu lie Gly Glu Thr 

455 460 * 465 

Asp Leu Glu Glu Thr Phe Glu Asn Leu He Leu Asn Gin Met Asn 

470 475 480 

Leu Ser Ser Tyr Gly Glu Asp Leu Leu Asp Pro Asn He Asp He 

485 490 495 
Glu Asp Tyr Phe Asp 

500 



<210> 15 

<211> 369 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2754425CD1 

<400> 15 

Met Val Trp Ala Pro Leu Gly Pro Pro Arg Thr Asp Cys Leu Thr 
1 5 10 * 15 

Leu Leu His Thr Pro Ser Lys Asp Ser Pro Lys Met Ser Leu Glu 
20 25 30 

Trp Leu Val Ala Trp Ser Trp Ser Leu Asp Gly Leu Arg Asp Cys 
35 40 45 

He Ala Thr Gly He Gin Ser Val Arg Asp Cys Asp Thr Thr Ala 
50 55 60 

Val He Thr Val Ala Cys Leu Leu Val Leu Phe Val Trp Tyr Cys 
65 70 " 75 

Tyr His Val Gly Arg Glu Gin Pro Arg Pro Tyr Val Ser Val Asn 
80 85 90 

Ser Leu Met Gin Ala Ala Asp Ala Asn Gly Leu Gin Asn Gly Tyr 
9S 100 105 

Val Tyr Cys Gin Ser Pro Glu Cys Val Arg Cys Thr His Asn Glu 

110 115 120 

Gly Leu Asn Gin Lys Leu Tyr His Asn Leu Gin Glu Tyr Ala Lys 

125 130 135 

Arg Tyr Ser Trp Ser Gly Met Gly Arg He His Lys Gly He Arg 

140 145 J 150 

Glu Gin Gly Arg Tyr Leu Asn Ser Arg Pro Ser He Gin Lys Pro 

155 160 165 

Glu Val Phe Phe Leu Pro Asp Leu Pro Thr Thr Pro Tyr Phe Ser 

170 175 " 180 

Arg Asp Ala Gin Lys His Asp Val Glu Val Leu Glu Arg Asn Phe 

185 190 195 

Gin Thr He Leu Cys Glu Phe Glu Thr Leu Tyr Lys Ala Phe Ser 

200 205 210 

Asn Cys Ser Leu Pro Gin Gly Trp Lys Met Asn Ser Thr Pro Ser 

215 220 225 

Gly Glu Trp Phe Thr Phe Tyr Leu Val Asn Gin Gly Val Cys Val 

230 235 240 

Pro Arg Asn Cys Arg Lys Cys Pro Arg Thr Tyr Arg Leu Leu Gly 

245 250 255 

Ser Leu Arg Thr Cys He Gly Asn Asn Val Phe Gly Asn Ala Cys 

260 265 270 

He Ser Val Leu Ser Pro Gly Thr Val He Thr Glu His Tyr Gly 

275 280 285 

Pro Thr Asn He Arg He Arg Cys His Leu Gly Leu Lys Thr Pro 

290 295 300 

Asn Gly Cys Glu Leu Val Val Gly Gly Glu Pro Gin Cys Trp Ala 

305 310 315 

Glu Gly Arg Cys Leu Leu Phe Asp Asp Ser Phe Leu His Ala Ala 
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320 325 330 

Phe His Glu Gly Ser Ala Glu Asp Gly Pro Arg Val Val Phe Met 

335 340 345 

Val Asp Leu Trp His Pro Asn Val Ala Ala Ala Glu Arg Gin Ala 

350 355 ~ 360 
Leu Asp Phe He Phe Ala Pro Gly Arg 

365 



<210> 16 
<211> 145 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2821526CD1 

<400> 16 

Met Glu Leu Val Gin Val Leu Lys Arg Gly Leu Gin Gin He Thr 

15 10 15 
Gly His Gly Gly Leu Arg Gly Tyr Leu Arg Val Phe Phe Arg Thr 

20 25 30 

Asn Asp Ala Lys Val Gly Thr Leu Val Gly Glu Asp Lys Tyr Gly 

35 40 " 45 

Asn Lys Tyr Tyr Glu Asp Asn Lys Gin Phe Phe Gly Arg His Ara 

50 55 60 

Trp Val Val Tyr Thr Thr Glu Met Asn Gly Lys Asn Thr Phe Trp 

65 70 75 

Asp Val Asp Gly Ser Met Val Pro Pro Glu Trp His Arg Trp Leu 

80 85 " 90 

His Ser Met Thr Asp Asp Pro Pro Thr Thr Lys Pro Leu Thr Ala 

95 100 105 

Arg Lys Phe He Trp Thr Asn His Lys Phe Asn Val Thr Gly Thr 

110 115 120 

Pro Glu Gin Tyr Val Pro Tyr Ser Thr Thr Arg Lys Lys He Gin 

125 130 ~ 135 

Glu Trp He Pro Pro Ser Thr Pro Tyr Lys 
140 145 



<210> 17 
<211> 255 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2876494CD1 

<400> 17 

ral Leu Ala Thr Ser Phe Val Leu Gly Ser Leu Gly Leu 

10 * 15 

1 Thr Thr Pro Lys Thr Leu Ala 

25 30 

a Val Gly Lys Val He He Asn 

40 45 

s Gly Leu Gly Tyr Lys Glu Glu 

55 - 60 

p Gly Val Arg Arg Lys Cys Gin 

70 ' 75 

r Asn Trp He Cys Gly Met Leu 

85 90 



Met 


Lys 


Val 


Leu 


Ala 


Thr Ser 


1 








5 




Ala 


Phe 


Tyr 


Leu 


Pro 


Leu Val 










20 




He 


Pro 


Glu 


Lys 


Leu 


Gin Glu 










35 




Ala 


Thr 


Thr 


Cys 


Thr 


Val Thr 










50 




Thr 


Val 


Cys 


Glu 


Val 


Gly Pro 










65 


Thr 


Gin 


Arg 


Leu 


Glu 


Cys Leu 










80 




His 


Phe 


Thr 


He 


Leu 


He Gly 
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95 100 105 

y Gin Glu Ala Phe Arg Phe Thr 
115 120 
e Ser Thr Asp Asp Glu Val Phe 
130 135 
s Phe Val Lys Phe Lys Tyr Ala 
145 iso 
r Arg Cys Asp Val Gin Leu Val 
160 165 
j Leu Tyr Phe Gly Leu Arg Val 
175 180 
a Asn Phe His Gin Ser Leu Thr 
190 195 
i Gly Leu Glu Val Asn Leu Asp 
205 210 
d Lys Trp Lys Lys Lys Val Ala 
220 225 
i lie Gly Val Val Gly Gly Val 
235 240 
i Leu Arg Gly Gly Leu Gin Gin 
250 ' 255 



Ser 


Ser Asp 


He 


Leu 


Glu 


Phe 








110 






Trp 


Arg Leu 


Ala 


Arg 


Gly 


Val 








125 






Lys 


Pro Phe 


Gin 


Ala 


Asn 


Ser 








140 






Gin Glu Tyr 


Asp 


Ser 


Gly 


Thr 








155 






Lys 


Asn Leu 


Arg 


Leu 


Val 


Lys 








170 




Leu 


Pro Pro 


Asn 


Leu 


Val 


Asn 








185 






Glu 


Asp Gin 


Lys 


Leu 


He 


Asp 








200 




Ser 


Tyr Ser 


Lys 


Pro 


His 


His 








215 






Ser 


Ala Leu 


Gly He Gly 


He 








230 






Leu 


Val Arg 


He 


Val 


Leu 


Cys 








245 





<210> 18 

<211> 246 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3403225CD1 

<400> 18 

Met Leu Val Thr Leu Gly Leu Leu Thr Ser Phe Phe Ser Phe Leu 
1 5 io 15 

Tyr Met Val Ala Pro Ser He Arg Lys Phe Phe Ala Gly Gly Val 
20 25 30 

Cys Arg Thr Asn Val Gin Leu Pro Gly Lys Val Val Val He Thr 
35 40 45 

Gly Ala Asn Thr Gly He Gly Lys Glu Thr Ala Arg Glu Leu Ala 
50 55 60 

Ser Arg Gly Ala Arg Val Tyr He Ala Cys Arg Asp Val Leu Lys 
65 70 75 

Gly Glu Ser Ala Ala Ser Glu He Arg Val Asp Thr Lys Asn Ser 
80 85 ~ 90 

Gin Val Leu Val Arg Lys Leu Asp Leu Ser Asp Thr Lys Ser He 
95 100 105 

Arg Ala Phe Ala Glu Gly Phe Leu Ala Glu Glu Lys Gin Leu His 
110 115 120 

He Leu He Asn Asn Ala Gly Val Met Met Cys Pro Tyr Ser Lys 
125 130 135 

Thr Ala Asp Gly Phe Glu Thr His Leu Gly Val Asn His Leu Gly 
^ 140 145 iso 

Thr Gly Val Thr Thr Tyr Ala Val His Pro Gly Val Val Arg Ser 
155 160 165 

Glu Leu Val Arg His Ser Ser Leu Leu Cys Leu Leu Trp Arg Leu 
170 175 180 

Phe Ser Pro Phe Val Lys Thr Ala Arg Glu Gly Ala Gin Thr Ser 
185 190 195 

Leu His Cys Ala Leu Ala Glu Gly Leu Glu Pro Leu Ser Gly Lys 
200 205 210 

Tyr Phe Ser Asp Cys Lys Arg Thr Trp Val Ser Pro Arg Ala Ara 
215 220 225 

Asn Asn Lys Thr Ala Glu Arg Leu Trp Asn Val Ser Cys Glu Leu 
230 235 240 
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Leu Gly He Arg Trp Glu 
245 



<210> 19 

<211> 467 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4163943CD1 

<400> 19 

Met Ala Ala Ala Ala Arg Ala Arg Val Ala Tyr Leu Leu Arg Gin 
15 10 15 

Leu Gin Arg Ala Ala Cys Gin Cys Pro Thr His Ser His Thr Tyr 
20 25 30 

Ser Gin Ala Pro Gly Leu Ser Pro Ser Gly Lys Thr Thr Asp Tyr 
35 40 45 

Ala Phe Glu Met Ala Val Ser Asn He Arg Tyr Gly Ala Ala Val 
50 55 * 60 

Thr Lys Glu Val Gly Met Asp Leu Lys Asn Met Gly Ala Lys Asn 
65 70 75 

Val Cys Leu Met Thr Asp Lys Asn Leu Ser Lys Leu Pro Pro Val 
80 85 90 

Gin Val Ala Met Asp Ser Leu Val Lys Asn Gly He Pro Phe Thr 
95 100 105 

Val Tyr Asp Asn Val Arg Val Glu Pro Thr Asp Ser Ser Phe Met 

HO- 115 120 

Glu Ala He Glu Phe Ala Gin Lys Gly Ala Phe Asp Ala Tyr Val 

125 130 * 135 

Ala Val Gly Gly Gly Ser Thr Met Asp Thr Cys Lys Ala Ala Asn 

140 145 150 

Leu Tyr Ala Ser Ser Pro His Ser Asp Phe Leu Asp Tyr Val Ser 

155 160 165 

Ala Pro He Gly Lys Gly Lys Pro Val Ser Val Pro Leu Lys Pro 

170 175 180 

Leu He Ala Val Pro Thr Thr Ser Gly Thr Gly Ser Glu Thr Thr 

185 190 195 

Gly Val Ala He Phe Asp Tyr Glu His Leu Lys Val Lys He Gly 

200 205 210 

He Thr Ser Arg Ala He Lys Pro Thr Leu Gly Leu He Asp Pro 

215 220 225 

Leu His Thr Leu His Met Pro Ala Arg Val Val Ala Asn Ser Gly 

230 235 240 

Phe Asp Val Leu Cys His Ala Leu Glu Ser Tyr Thr Thr Leu Pro 

245 250 255 

Tyr His Leu Arg Ser Pro Cys Pro Ser Asn Pro He Thr Arg Pro 

260 265 270 

Ala Tyr Gin Gly Ser Asn Pro He Ser Asp He Trp Ala He His 

275 280 285 

Ala Leu Arg He Val Ala Lys Tyr Leu Lys Arg Ala Val Arg Asn 

290 295 300 

Pro Asp Asp Leu Glu Ala Arg Ser His Met His Leu Ala Ser Ala 

305 310 315 

Phe Ala Gly He Gly Phe Gly Asn Ala Gly Val His Leu Cys His 

320 325 330 

Gly Met Ser Tyr Pro He Ser Gly Leu Val Lys Met Tyr Lys Ala 

335 340 " 345 

Lys Asp Tyr Asn Val Asp His Pro Leu Val Pro His Gly Leu Ser 

350 355 360 

Val Val Leu Thr Ser Pro Ala Val Phe Thr Phe Thr Ala Gin Met 

365 370 375 

Phe Pro Glu Arg His Leu Glu Met Ala Glu He Leu Gly Ala Asp 
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380 385 390 

Thr Arg Thr Ala Arg He Gin Asp Ala Gly Leu Val Leu Ala Asp 

395 400 405 

Thr Leu Arg Lys Phe Leu Phe Asp Leu Asp Val Asp Asp Gly Leu 

410 415 * 420 

Ala Ala Val Gly Tyr Ser Lys Ala Asp He Pro Ala Leu Val Lys 

425 430 435 

Gly Thr Leu Pro Gin Glu Arg Val Thr Lys Leu Ala Pro Arg Pro 

440 445 450 

Gin Ser Glu Glu Asp Leu Ala Ala Leu Phe Glu Ala Ser Met Lys 

455 460 465 

Leu Tyr 



<210> 20 

<211> 317 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4293484CD1 

<400> 20 

Met Ala Asp Ser Ala Gin Ala Gin Lys Leu Val Tyr Leu Val Thr 
1 5 10 15 

Gly Gly Cys Gly Phe Leu Gly Glu His Val Val Arg Met Leu Leu 
20 25 30 

Gin Arg Glu Pro Arg Leu Gly Glu Leu Arg Val Phe Asp Gin His 
35 40 45 

Leu Gly Pro Trp Leu Glu Glu Leu Lys Thr Gly Thr Arg Asn Val 
50 55 60 

He Glu Ala Cys Val Gin Thr Gly Thr Arg Phe Leu Val Tyr Thr 
65 70 75 

Ser Ser Met Glu Val Val Gly Pro Asn Thr Lys Gly His Pro Phe 
80 85 90 

Tyr Arg Gly Asn Glu Asp Thr Pro Tyr Glu Ala Val His Arg His 
*5 100 105 

Pro Tyr Pro Cys Ser Lys Ala Leu Ala Glu Trp Leu Val Leu Glu 
110 115 120 

Ala Asn Gly Arg Lys Val Arg Gly Gly Leu Pro Leu Val Thr Cys 
125 130 135 

Ala Leu Arg Pro Thr Gly He Tyr Gly Glu Gly His Gin He Met 
140 145 150 

Arg Asp Phe Tyr Arg Gin Gly Leu Arg Leu Gly Gly Trp Leu Phe 
i55 160 ~ 165 

Arg Ala He Pro Ala Ser Val Glu His Gly Arg Val Tyr Val Gly 
170 175 180 

Asn Val Ala Trp Met His Val Leu Ala Ala Arg Glu Leu Glu Gin 
185 190 195 

Arg Ala Thr Leu Met Gly Gly Gin Val Tyr Phe Cys Tyr Asp Gly 
200 205 210 

Ser Pro Tyr Arg Ser Tyr Glu Asp Phe Asn Met Glu Phe Leu Gly 
215 220 225 

Pro Cys Gly Leu Arg Leu Val Gly Ala Arg Pro Leu Leu Pro Tyr 
230 235 240 

Trp Leu Leu Val Phe Leu Ala Ala Leu Asn Ala Leu Leu Gin Trp 
245 250 255 

Leu Leu Arg. Pro Leu Val Leu Tyr Ala Pro Leu Leu Asn Pro Tyr 
260 265 270 

Thr Leu Ala Val Ala Asn Thr Thr Phe Thr Val Ser Thr Asp Lys 
275 280 285 

Ala Gin Arg His Phe Gly Tyr Glu Pro Leu Phe Ser Trp Glu Asp 
290 295 300 

Ser Arg Thr Arg Thr He Leu Trp Val Gin Ala Ala Thr Gly Ser 
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305 310 315 

Ala Gin 



<210> 21 

<211> 181 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 4440080CD1 

<400> 21 

Met Phe Ser Ala lie Arg Ser Gin His Ser Gly Val Asp lie Cys 
1 5 10 " 15 

lie Asn Asn Ala Gly Leu Ala Arg Pro Asp Thr Leu Leu Ser Gly 
20 25 30 

Ser Thr Ser Gly Trp Lys Asp Met Phe Asn Val Asn Val Leu Ala 
35 40 45 

Leu Ser He Cys Thr Arg Glu Ala Tyr Gin Ser Met Lys Glu Arg 
50 55 60 

Asn Val Asp Asp Gly His He lie Asn He Asn Ser Met Ser Gly 
65 70 75 

His Arg Val Leu Pro Leu Ser Val Thr His Phe Tyr Ser Ala Thr 
80 85 "* 90 

Lys Tyr Ala Val Thr Ala Leu Thr Glu Gly Leu Arg Gin Glu Leu 
95 100 " 105 

Arg Glu Ala Gin Thr His He Arg Ala Thr Cys He Ser Pro Gly 
110 115 120 

Val Val Glu Thr Gin Phe Ala Phe Lys Leu His Asp Lys Asp Pro 
125 130 135 

Glu Lys Ala Ala Ala Thr Tyr Glu Gin Met Lys Cys Leu Lys Pro 
140 145 150 

Glu Asp Val Ala Glu Ala Val He Tyr Val Leu Ser Thr Pro Ala 
155 160 165 

His He Gin He Gly Asp He Gin Met Arg Pro Thr Glu Gin Val 
170 175 180 

Thr 



<210> 22 
<211> 360 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5495687CD1 

<400> 22 

Met Val Pro Ala Ala Gly Arg Arg Pro Pro Arg Val Met Arg Leu 
1 5 10 15 

Leu Gly Trp Trp Gin Val Leu Leu Trp Val Leu Gly Leu Pro Val 

20 25 30 

Arg Gly Val Glu Val Ala Glu Glu Ser Gly Arg Leu Trp Ser Glu 

35 40 45 

Glu Gin Pro Ala His Pro Leu Gin Val Gly Ala Val Tyr Leu Gly 

50 55 " 60 

Glu Glu Glu Leu Leu His Asp Pro Met Gly Gin Asp Arg Ala Ala 

65 70 75 

Glu Glu Ala Asn Ala Val Leu Gly Leu Asp Thr Gin Gly Asp His 

80 85 90 
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Met Val Met Leu Ser Val lie Pro Gly Glu Ala Glu Asp Lys Val 
95 100 105 

Ser Ser Glu Pro Ser Gly Val Thr Cys Gly Ala Gly Gly Ala Glu 

110 115 120 

Asp Ser Arg Cys Asn Val Arg Glu Ser Leu Phe Ser Leu Asp Gly 

125 130 135 

Ala Gly Ala His Phe Pro Asp Arg Glu Glu Glu Tyr Tyr Thr Glu 
. 140 145 " 150 

Pro Glu Val Ala Glu Ser Asp Ala Ala Pro Thr Glu Asp Ser Asn 

155 160 165 

Asn Thr Glu Ser Leu Lys Ser Pro Lys Val Asn Cys Glu Glu Arg 

170 175 "* 180 

Asn He Thr Gly Leu Glu Asn Phe Thr Leu Lys He Leu Asn Met 

185 190 195 

Ser Gin Asp Leu Met Asp Phe Leu Asn Pro Asn Gly Ser Asp Cys 

200 205 210 

Thr Leu Val Leu Phe Tyr Thr Pro Trp Cys Arg Phe Ser Ala Ser 

215 220 225 

Leu Ala Pro His Phe Asn Ser Leu Pro Arg Ala Phe Pro Ala Leu 

230 235 240 

His Phe Leu Ala Leu Asp Ala Ser Gin His Ser Ser Leu Ser Thr 

245 250 255 

Arg Phe Gly Thr Val Ala Val Pro Asn He Leu Leu Phe Gin Gly 

260 265 270 

Ala Lys Pro Met Ala Arg Phe Asn His Thr Asp Arg Thr Leu Glu 

275 280 285 

Thr Leu Lys He Phe He Phe Asn Gin Thr Gly He Glu Ala Lys 

290 295 300 

Lys Asn Val Val Val Thr Gin Ala Asp Gin He Gly Pro Leu Pro 

305 310 315 

Ser Thr Leu He Lys Ser Val Asp Trp Leu Leu Val Phe Ser Leu 

320 325 330 

Phe Phe Leu He Ser Phe He Met Tyr Ala Thr He Arg Thr Glu 

335 340 " 345 

Ser He Arg Trp Leu He Pro Gly Gin Glu Gin Glu His Val Glu 

350 355 360 



<210> 23 
<211> 476 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5527735CD1 

<400> 23 

Met Ala Ser Pro Phe Ser Gly Ala Leu Gin Leu Thr Asp Leu Asp 
1 5 10 15 

Asp Phe He Gly Pro Ser Gin Glu Cys He Lys Pro Val Lys Val 

20 25 30 

Glu Lys Arg Ala Gly Ser Gly Val Ala Lys He Arg He Glu Asp 

35 40 45 

Asp Gly Ser Tyr Phe Gin He Asn Gin Asp Gly Gly Thr Arg Arg 

50 55 60 

Leu Glu Lys Ala Lys Val Ser Leu Asn Asp Cys Leu Ala Cys Ser 

65 70 ~ 75 

Gly Cys He Thr Ser Ala Glu Thr Val Leu He Thr Gin Gin Ser 

80 85 90 

His Glu Glu Leu Lys Lys Val Leu Asp Ala Asn Lys Met Ala Ala 

95 100 105 

Pro Ser Gin Gin Arg Leu Val Val Val Ser Val Ser Pro Gin Ser 
110 115 120 

Arg Ala Ser Leu Ala Ala Arg Phe Gin Leu Asn Pro Thr Asp Thr 
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<210> 24 

<211> 621 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5540437CD1 

<400> 24 

Met Ser Gly Cys Gly Leu Phe Leu Arg Thr Thr Ala Ala Ala Arg 
1 5 10 15 

Ala Cys Arg Gly Leu Val Val Ser Thr Ala Asn Arg Arg Leu Leu 

20 25 30 

Arg Thr Ser Pro Pro Val Arg Ala Phe Ala Lys Glu Leu Phe Leu 

35 40 45 
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Gly Lys lie Lys Lys Lys Glu Val Phe Pro Phe Pro Glu Val Ser 
50 55 60 

Gin Asp Glu Leu Asn Glu lie Asn Gin Phe Leu Gly Pro Val Glu 
65 70 75 

Lys Phe Phe Thr Glu Glu Val Asp Ser Arg Lys He Asp Gin Glu 
80 85 " 90 

Gly Lys He Pro Asp Glu Thr Leu Glu Lys Leu Lys Ser Leu Gly 
95 100 105 

Leu Phe Gly Leu Gin Val Pro Glu Glu Tyr Gly Gly Leu Gly Phe 
HO 115 120 

Ser Asn Thr Met Tyr Ser Arg Leu Gly Glu He He Ser Met Asp 
125 130 135 

Gly Ser He Thr Val Thr Leu Ala Ala His Gin Ala He Gly Leu 
140 145 150 

Lys Gly He He Leu Ala Gly Thr Glu Glu Gin Lys Ala Lys Tyr 
155 160 165 

Leu Pro Lys Leu Ala Ser Gly Glu His He Ala Ala Phe Cys Leu 
170 175 180 

Thr Glu Pro Ala Ser Gly Ser Asp Ala Ala Ser He Arg Ser Arg 
185 190 195 

Ala Thr Leu Ser Glu Asp Lys Lys His Tyr He Leu Asn Gly Ser 
200 205 210 

Lys Val Trp He Thr Asn Gly Gly Leu Ala Asn He Phe Thr Val 
215 220 225 

Phe Ala Lys Thr Glu Val Val Asp Ser Asp Gly Ser Val Lys Asp 
230 235 240 

Lys He Thr Ala Phe He Val Glu Arg Asp Phe Gly Gly Val Thr 
245 250 255 

Asn Gly Lys Pro Glu Asp Lys Leu Gly He Arg Gly Ser Asn Thr 
260 265 270 

Cys Glu Val His Phe Glu Asn Thr Lys He Pro Val Glu Asn He 
275 280 285 

Leu Gly Glu Val Gly Asp Gly Phe Lys Val Ala Met Asn He Leu 
290 295 300 

Asn Ser Gly Arg Phe Ser Met Gly Ser Val Val Ala Gly Leu Leu 
305 310 "* 315 

Lys Arg Leu He Glu Met Thr Ala Glu Tyr Ala Cys Thr Arg Lys 
320 325 330 

Gin Phe Asn Lys Arg Leu Ser Glu Phe Gly Leu He Gin Glu Lys 
335 340 345 

Phe Ala Leu Met Ala Gin Lys Ala Tyr Val Met Glu Ser Met Thr 
350 355 360 

Tyr Leu Thr Ala Gly Met Leu Asp Gin Pro Gly Phe Pro Asp Cys 
365 370 375 

Ser He Glu Ala Ala Met Val Lys Val Phe Ser Ser Glu Ala Ala 
380 385 390 

Trp Gin Cys Val Ser Glu Ala Leu Gin He Leu Gly Gly Leu Gly 
395 400 405 

Tyr Thr Arg Asp Tyr Pro Tyr Glu Arg He Leu Arg Asp Thr Arg 
410 415 420 

He Leu Leu He Phe Glu Gly Thr Asn Glu He Leu Arg Met Tyr 
425 430 435 

He Ala Leu Thr Gly Leu Gin His Ala Gly Arg He Leu Thr Thr 
440 445 450 

Arg He His Glu Leu Lys Gin Ala Lys Val Ser Thr Val Met Asp 
455 460 465 

Thr Val Gly Arg Arg Leu Arg Asp Ser Leu Gly Arg Thr Val Asp 
470 475 480 

Leu Gly Leu Thr Gly Asn His Gly Val Val His Pro Ser Leu Ala 
485 490 495 

Asp Ser Ala Asn Lys Phe Glu Glu Asn Thr Tyr Cys Phe Gly Arg 
500 505 510 

Thr Val Glu Thr Leu Leu Leu Arg Phe Gly Lys Thr He Met Glu 
515 520 525 

Glu Gin Leu Val Leu Lys Arg Val Ala Asn He Leu He Asn Leu 
530 535 540 

Tyr Gly Met Thr Ala Val Leu Ser Arg Ala Ser Arg Ser He Arg 
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545 550 555 

lie Gly Leu Arg Asn His Asp His Glu Val Leu Leu Ala Asn Thr 

560 565 570 

Phe Cys Val Glu Ala Tyr Leu Gin Asn Leu Phe Ser Leu Ser Gin 

575 580 585 

Leu Asp Lys Tyr Ala Pro Glu Asn Leu Asp Glu Gin He Lys Lys 

590 595 600 

Val Ser Gin Gin He Leu Glu Lys Arg Ala Tyr He Cys Ala His 

605 610 ~ 615 
Pro Leu Asp Arg Thr Cys 

620 



<210> 25 

<211> 245 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5596281CD1 
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<210> 26 

<211> 159 

<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 5731013 

<400> 26 

Met Thr Ala Arg Gly Thr Pro Ser Arg Phe Leu Ala Ser Val Leu 

.1 5 10 15 

His. Asn Gly Leu Gly Arg Tyr Val Gin Gin Leu Gin Arg Leu Ser 

20 25 " 30 

Phe Ser Val Ser Arg Asp Gly Ala Ser Ser Arg Gly Ala Arg Glu 

35 40 45 

Phe Val Glu Arg Glu Val lie Asp Phe Ala Arg Arg Asn Pro Gly 

50 55 " 60 

Val Val lie Tyr Val Asn Ser Arg Pro Cys Cys Val Pro Arg Val 

65 70 75 

Val Ala Glu Tyr Leu Asn Gly Ala Val Arg Glu Glu Ser He His 

80 85 90 

Cys Lys Ser Val Glu Glu He Ser Thr Leu Val Gin Lys Leu Ala 

95 100 " 105 

Asp Gin Ser Gly Leu Asp Val He Arg He Arg Lys Pro Phe His 

HO 115 120 

Thr Asp Asn Pro Ser He Gin Gly Gin Trp His Pro Phe Thr Asn 

125 130 135 

Lys Pro Thr. Thr Phe Arg Gly Leu Arg Pro Arg Glu Val Gin Asp 

140 145 150 
Pro Ala Pro Ala Gin Val Gin Ala Gin 
155 



<210> 27 

<211> 291 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5731162CD1 

<400> 27 

Met Ala Cys Leu Ser Pro Ser Gin Leu Gin Lys Phe Gin Gin Asp 

1 5 10 15 

Gly Phe Leu Val Leu Glu Gly Phe Leu Ser Ala Glu Glu Cys Val 

20 25 30 

Ala Met Gin Gin Arg He Gly Glu He Val Ala Glu Met Asp Val 

35 40 45 

Pro Leu His Cys Arg Thr Glu Phe Ser Thr Gin Glu Glu Glu Gin 

50 55 60 

Leu Arg Ala Gin Gly Ser Thr Asp Tyr Phe Leu Ser Ser Gly Asp 

65 70 75 

Lys He Arg Phe Phe Phe Glu Lys Gly Val Phe Asp Glu Lys Gly 

80 85 ~ 90 

Asn Phe Leu Val Pro Pro Glu Lys Ser He Asn Lys He Gly His 

95 100 105 

Ala Leu His Ala His Asp Pro Val Phe Lys Ser He Thr His Ser 

110 115 120 

Phe Lys Val Gin Thr Leu Ala Arg Ser Leu Gly Leu Gin Met Pro 

125 130 135 

Val Val Val Gin Ser Met Tyr He Phe Lys Gin Pro His Phe Gly 

140 145 150 

Gly Glu Val Ser Pro His Gin Asp Ala Ser Phe Leu Tyr Thr Glu 

155 160 " 165* 

Pro Leu Gly Arg Val Leu Gly Val Trp He Ala Val Glu Asp Ala 

170 175 180 

Thr Leu Glu Asn Gly Cys Leu Trp Phe He Pro Gly Ser His Thr 

185 190 195 
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Ser Gly Val Ser Arg Arg Met Val Arg Ala Pro Val Gly Ser Ala 

200 205 210 

Pro Gly Thr Ser Phe Leu Gly Ser Glu Pro Ala Arg Asp Asn Ser 

215 220 225 

Leu Phe Val Pro Thr Pro Val Gin Arg Gly Ala Leu Val Leu lie 

230 235 240 

His Gly Glu Val Val His Lys Ser Lys Gin Asn Leu Ser Asp Arg 

245 250 255 

Ser Arg Gin Ala Tyr Thr Phe His Leu Met Glu Ala Ser Gly Thr 

260 265 270 

Thr Trp Ser Pro Glu Asn Trp Leu Gin Pro Thr Ala Glu Leu Pro 

275 280 285 

Phe Pro Gin Leu Tyr Thr 

290 



<210> 28 

<211> 1557 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 543496CB1 

<400> 28 

ctgagtgcga cggcgcaggt ctgcaccatg 
gtgcgtgcag ctccccacag cggcccgctg 
acagacaccg tgtatgacgt ggtggtgtcg 
tgtgccttgg gatatgatat tcactttcat 
ccaaagaaag tactggagaa attgtcagaa 
cctggctctg caacgcttct cagtagtttt 
tacagagcct ttcggcgaat gcaggtgtgg 
gataaggata atttagatga catgggctat 
ctcactaagc agttggaggc tgtgtctgac 
attcgctata cctggccttg tccatttcct 
accctaggtg atggcagcac cttccagacc 
tccggagtac ggcaggctgt tggaatccag 
gttgtggcta ctctgcattt atcagaggcc 
cttccctctg ggcctattgc tctgctcccg 
tccacgtccc atgaacatgc agcagagcta 
gccgttaact ctgccttttg gagtgatgct 
gccatgctgc agtatgctgt cagccttctg 
cccccaagcg tagccagggt ggatgccaaa 
catgctgctg agtacgtcag gcctcgggtg 
catccgcttg caggacaggg tgtcaacatg 
cacctcagta cggcagcctt caatgggaag 
tatgaaacag aaagacagcg tcacaacact 
aggctctatt ctaccagtgc ctccccgctt 
acaaatgcag tgtctccact caaagaacag 
ctctcctaaa gaaagattac gttgatgaaa 
tattttcaag atcttattta atttaataaa 



gcggcccggc ttgtcagccg atgcggggct 60 
gtgtcctggc gcaggtggtc cggcgcctca 120 
ggtggaggcc tggtgggcgc tgccatggcc 180 
gacaagaaaa tcctgttgct cgaagcaggt 240 
acttacagca acagggtcag ctccatttcc 300 
ggtgcctggg accatatctg caacatgaga 3 60 
gacgcctgct cagaggccct gataatgttt 420 
atcgtggaga atgatgtcat catgcatgct 480 
cgagtgacgg ttctctacag gagcaaagcc 540 
atggccgact ccagcccttg ggttcatatt 600 
aaattgttga taggtgcaga tggtcacaac 660 
aatgtgagct ggaactatga ccagtctgct 720 
acagaaaaca acgtagcctg gcagagattt 780 
ctctcagaca ccttgagttc cttggtttgg 840 
gttagcatgg atgaggaaaa atttgtggat 900 
gaccacacgg acttcatcga cacagctggt 960 
aagcccacta aggtctcggc tcgccagctg 1020 
agccgagttc tgtttcctct tgggttggga 1080 
gcgctcattg gggatgcagc ccacagagtc 1140 
ggctttgggg atatctccag cttggcccat 1200 
gacttaggtt ccgtgagcca cctcacaggt 1260 
gctcttctgg ctgctacaga cttactaaaa 1320 
gtgttgctca ggacgtgggg cttgcaggcc 1380 
attatggcct ttgcaagcaa atgagtactc 1440 
aagaacatcc tgcccaggac ccatcataca 1500 
cttactttac attaaaaaaa aaaaaaa 1557 



<210> 29 
<211> 1106 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 907607CB1 

<400> 29 

cccacgcgtc cgggttcccc ggcctctctt ggtcagggtg acgcagtagc ctgcaaacct 60 
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cggcgcgtag 
cttcccagtg 
gtcagggaga 
gagctgcggc 
acaccattaa 
cagcctttta 
acagcttaaa 
ccccagctgc 
aaagtgtctt 
ggaatttact 
ccctggttac 
tggcaatgat 
gggatgtaat 
gtaccagctg 
tttatattcc 
aagcaaactc 
taatatatat 
ctcttcagag 



gccaccgcac 
cccgcccagc 
cccttaggca 
tgttgccgcc 
gaaacagcct 
tcacccagtg 
gtttatacca 
agcatttcgc 
ctttggacca 
gaaaccagat 
tgaggaaaca 
taaggaattg 
ctacaaaggc 
ccctagttca 
ggaggtagaa 
accttaaaat 
caagttttta 
aatgatatat 



ttatccgcag 
taccggccag 
ctccggacta 
gggctgcgca 
ctgcatcagt 
agatacatgt 
ggaaaaccag 
tcccctctgg 
gatttcatca 
atttatgcaa 
ccttcaggag 
ttagatacta 
tttgaagatg 
atcattactc 
ggcgtagaac 
aatctggatt 
ttattaatat 
aaaaaa 



caggaccgcc 
cctgcggctg 
agatggcggc 
ggcggttctg 
ttgtacaaag 
ttattcaaac 
ttcttgagac 
ctaggcagtt 
ctgtcacaaa 
caatcatgga 
aagcaggatc 
gaatacggcc 
gcattgtaca 
tgaaaaatgg 
aggttatgga 
ttctttgggc 
gctgaggaac 



cgcagccggt 
cgcagatctt 
gacggccagg 
tcatatgttg 
accacttttc 
acaagatacc 
aaggaccatg 
atttaggatt 
ggaaaatgaa 
cttctttgca 
tgaagaagat 
aactgtgcag 
gctgaaactc 
aattcagaac 
tgatgaatca 
ataacagtca 
ttgaagatta 



agggtgggct 
tcgtggttct 
cggggctggg 
aagaatccat 
ccactacctg 
ccaaatccaa 
gattttccca 
gaaggagtaa 
gaattagact 
tctggcttac 
gatgaagttg 
gaagatggag 
cagggttctt 
atgctgcagt 
gatgaaaaag 
gacttgttga 
ataaaatatg 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1106 



<210> 30 

<211> 2180 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1290078CB1 

<400> 30 

gccccacaca ggccgcggcg gctggctcgg 
ggaagccagg cggctggcgg aggaggagag 
gccgcagact tacttccccg gctcagcagg 
tatgcaaagt tgtgaatcca gtggtgacag 
gagaagggga cagcctcgtg tggtggtgat 
caaagcactt cttgagcagg gtttcacgga 
cggaggccgt gtgcagagtg tgaaacttgg 
gatccatggc tcccatggga accctatcta 
agagacaacc gatggggaac gcagcgtggg 
ggcctgctac cttaccaacc acggccgcag 
cgatttatac aacgaggtct ataacttgac 
caatgctgaa agtcaaaata gcgtgggggt 
caggaatgac cctgacgacc cagaggctac 
gtacctgaag gtggagagct gtgagagcag 
cgccttcggg gagtggaccg agatccccgg 
gcgggttgtg gagctgctgg cggagggcat 
tgtccgctgc attcactggg accaggcctc 
ccggggtgag ggcgaccaca atcacgacac 
ccgggggggc aggtgggatg aggatgagca 
tgagctgatc ccggcggacc atgtgattgt 
gtacaccagt ttcttccggc caggcctgcc 
gggcattggc accaccgaca agate tttct 
gtgeaacage ctacagtttg tgtgggagga 
acctgagctc tggtaccgea agate tgegg 
cggccatgtg ctgagegget ggatctgegg 
tgacgaggca gtggccgaga tetgeaegga 
cattccaaaa cctcggcgaa tcttgcgctc 
ctcctattca tacaegcagg tgggctccag 
cctgccgtac aeggagaget caaagacagc 
cacccaccgc aagtactatt ccaccaccca 
tgcccgcctc attgagatgt accgagacct 
getgetgaga agagecacta actcgtgacc 
tgccttcctg atcctctgta gaaaggattt 
gccttcagac ctggccctgt agcttttctt 
ccgttgagtt acctctgtgc tggatcccgt 
tgttattgta agtgccttca atactttgea 
tgcccctcaa aaaaaaaaaa 



gcccctacgg tcccggcggc ggctggagga 60 
aeggaggagg ccgagaccgg agcgccgctc 120 
gaaaggttcc tagaaggtga gegeggaegg 180 
tgeggatgae cctctcagtc gcggcctacg 240 
cggcgccggc ttggctggcc tggctgeage 3 00 
tgtcactgtg cttgaggctt ccagccacat 360 
acacgccacc tttgagctgg gagccacctg 420 
tcatctagca gaagccaacg gcctcctgga 480 
ccgcatcagc ctctattcca agaatggcgt 540 
gatccccaag gacgtggttg aggaattcag 600 
ccaggagttc ttccggcacg ataaaccagt 660 
gttcacccga gaggaggtgc gtaacegcat 720 
caagcgcctg aagctcgcca tgatccagca 780 
ctcacacagc atggacgagg tgtccctgag 840 
cgctcaccac atcatcccct egggcttcat 900 
ccctgcccac gtcatccagc tagggaaacc 960 
agcccgcccc agaggecctg agattgagee 1020 
tggggagggt ggccagggtg gagaggagee 1080 
gtggtcggtg gtggtggagt gcgaggactg 1140 
gaccgtgtcg ctaggtgtgc taaagaggca 1200 
cacagagaag gtggctgcca tccaccgcct 1260 
ggaattcgag gagcccttct ggggccctga 1320 
egaageggag agccacaccc tcacctaccc 1380 
ctttgatgtc ctctacccgc ctgagegcta 1440 
ggaggaggee ctegtcatgg agaagtgtga 1500 
gatgetgegt cagttcacag ggaaccccaa 1560 
ggcctggggc agcaaccctt acttccgcgg 1620 
eggggeggat gtggagaagc tggccaagcc 1680 
gcccatgcag gtgctgtttt ccggtgaggc 1740 
cggtgctctg ctgtccggcc agegtgagge 1800 
cttccagcag gggacctgag ggctgtcctc 1860 
tccagcctgc cccttgctgc cgtgtgctcc 1920 
ttatcttctg tagagctagc cgccctgact 1980 
tttctccagg ctgggccgtg agcaggtggg 2040 
gcccccactt gcctaccctc tgtcctgcct 2100 
ttttgggata ataaaaaagg ctccctcccc 2160 

2180 
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<210> 31 

<211> 1311 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1302741CB1 

<400> 31 

ccgccgccca cccgagccta tctgggctgc 
cgccccaatc cttgcgtgtg cttgcagtcc 
ccagtctccg acttccattt cccaccctaa 
gttgtcctcg ccctgctgcg ctgagtgtcc 
acgctgcaga gctcgtgggt gaccttccgc 
agtctggctt tcgtctacgg ctccggggtg 
aagaatgcta tgctggactt tgtgttcaca 
aacctgaaga aaaattggag tea c tact ct 
acgtccatcc agaataacta tggcgctgga 
ggtaggctta tcaaatatgg agttattagc 
tggaataact tatacattgc tggacgactc 
gaggatgtca ctcttagatc agccctcgat 
ttcctcatgc tccccgaaag cttttctgaa 
tcctattcag gtgactttcg gatggtggtt 
gtgaagecca atatagecca ctttcgagag 
caagtggtgt ataaaageca gcaaggctgg 
ttcactcagc tgatgacatt gcccaaaacc 
cctcctggaa aaaacagaga tgtggaagaa 
tgtggagatg tggtgcgact agggctttca 
agcacgaaag gcatttttac tgctggcctg 
ctgcacaaaa tgtggaaagg gtggctgagg 
atgttatatg tagatgaata aagtgtttga 



gtcttctcgc cgctgctctt cgtggcccaa 60 
caccccacac teagecttgt gtccctcgat 120 
accgcctacc cggtgtctgt tccccgcccg 180 
cctgttagcc tcgaccccat ggcgctgcag 240 
aagatcctgt ctcacttccc cgaggagctg 300 
taccgccagg cagggecgag ttcagaccag 3 60 
gtagatgacc ctgtcgcatg gcattcaaag 420 
ttcctaaaag ttttagggee caagattatc 480 
gtttactaca attcattgat catgtgtaat 540 
actaaegtte tgattgaaga tctcctcaac 600 
caaaaacegg tgaaaattat ctcagtgaac 660 
agaaatctga agagtgctgt gaccgctgct 720 
gaagacctct tcatagagat tgccggtctc 780 
ggagaagata aaacaaaagt gttgaatatt 840 
ctctatggca gcatactaca ggaaaatcct 900 
ctggagatag ataaaagece agaaggacag 960 
ttacagcaac agataaatca tattatggac 1020 
actttattcc aagtggctca tgatcccgac 1080 
gcaatcgtga gaeegtctag tataagacag 1140 
aagaagtcag tgatttatag ttcactaaaa 1200 
aaaacatcct gattttgett gcttttatat 1260 
tcctttttga caaaaaaaaa a 1311 



<210> 32 

<211> 921 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1541028CB1 

<400> 32 

ggcccctgcg ccccacgtgg cgccctttcc 
gacaactctc ccccctcccc cggcacccgg 
gggtggctga gctgagaggg etcegggaag 
tttgtgccgc gggacgccca gcggcacgac 
attttgeggg acttegggge tgtgagctgg 
ggctggtccc cacctctggc ccccgggtgc 
tgccaaccca gcaactgccg ccggtgcccg 
agctttatga gtgccaacac etteggcaat 
eggctcgagg gccgctgtgg gcccaccaat 
atccctcctg gctgtgagct ggtggtcggc 
tgtctactgg tggacgactc ttttctacac 
gggectcgag tggtcttcat cgtggacctc 
caggccctcg actttgtctt cgccccagac 
caggctggag agacactgeg ctcagggacg 
tgcgggggtg ggcgggggcg gaggatggga 
tgaatcctca aaaaaaaaaa a 



gggcgggggg ggtggggagg cacctccagc 60 
ctgatttget gtgccactgg gagggttegg 120 
gagtgacgtc aggacctgcc ttcagccccc 180 
gtggagctcc tggagagcag cttccctgcc 240 
gacttctcag ggactacccc tccgcctcgg 300 
taccagctcc tgctgtacca ageaggcegg 360 
ggggectate gggcactgag ggggcttcga 420 
geeggctttt ccgttctcct gcctggggcc 480 
gcccgggtca gatgecatet gggcctaaag 540 
ggtgagcccc agtgctgggc tgaggggcac 600 
acagtggctc acaatggctc ccccgaagat 660 
tggcacccca acgtggcagg ggctgagege 720 
ccttgaagga aggtgctccc ttcacacacc 780 
gcttgatggt agccaggacc tcctctctac 840 
actggctagt gagcactgaa atataaattc 900 

921 



<210> 33 

<211> 2032 

<212> DNA 

<213> Homo sapiens 



27/40 



WO 01/44448 



PCT/US00/33158 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1597687CB1 

<400> 33 

agaaaccagg cggcgcgttc ccggtggcgg cgccctggac tcccgggccc gcgcatcccc 60 
gccagccttc cttaaggcgg atgggtggcc cccgagaccc cgtcggaccc atggtttcca 120 
gtgcagcgcg gagtgggcga tgccagcgtg ccaggagcca tgtctgacca ggacgtttgg 180 
aagatcatat ccatgccaga ggctcttgtg aggagatgag ttggtaaaga gagaggctgg 240 
gatgaagatg ctgccaggag tgggcgtgtt tgggactggc agctccgccc gagttctggt 3 00 
cccactgctg agggcagaag ggttcactgt tgaggccctg tgggggaaga ctgaggagga 360 
ggcgaagcag cttgctgagg agatgaacat cgccttctac accagccgga ctgatgacat 420 
cttgctgcat caagatgtgg atctggtgtg catcagcatc ccccctccac tcacccggca 480 
gatatccgtg aaggctctag gtattgggaa gaatgtggtt tgcgagaagg cagcaacatc 540 
ggtggatgcc ttccggatgg tgacagcctc gcgctactac ccgcagctca tgagcctggt 600 
agggaacgtg ctgcgcttcc tgcctgcctt cgtgcgcatg aaacagctga tttcggaaca 660 
ctatgtggga gcggtgatga tctgtgatgc ccgcatctac tcaggcagcc tgctgagccc 720 
cagctatggc tggatctgtg atgagctcat gggcggcggg ggcttgcaca ccatggggac 780 
ctacattgtg gacctgctga cccacctgac cggccggaga gccgagaagg tgcacgggct 840 
gctcaagaca ttcgtgaggc agaacgctgc catccgtggc atccggcacg tcactagcga 900 
tgacttctgt ttcttccaga tgctcatggg tgggggtgtg tgtagcacag tgacactcaa 960 
cttcaacatg ccaggcgcct ttgtgcatga agtcatggtg gtaggctctg caggacgcct 1020 
cgtcgcccgg ggagccgacc tctatgggca gaagaactct gccacgcaag aggagctgct 1080 
cttgagggac icgctggcag tgggcgcagg actgcctgag caggggcccc aggatgtccc 1140 
gctgctgtac ctgaagggca tggtctacat ggtgcaggcc ttgcgccagt ccttccaggg 1200 
gcagggcgac cgccgcacct gggaccgcac ccctgtctcc atggccgcct ccttcgagga 1260 
tgggctgtac atgcagagcg tggtggatgc catcaagagg tcgagccgat ccggggagtg 1320 
ggaggctgtg gaggtgctga cggaggagcc cgacaccaac cagaacctgt gtgaggcact 1380 
tcagcggaac aacctatgag cctgcacctg ggctccttgc cacagggcag agggaccagg 1440 
gaggggaaca ggagccagac atgacaggga cttggcccta gcagagacag tgtctttcat 1500 
ttaatgagtc tgggtgaagc cagaggtggc tctgggactt cgccccctcc agtggctcat 1560 
ctgctcctca gacttgcagg gtggcaaatg ttcctgttgc caggttacca cggtgtgcac 1620 
agggcagagc tgccgccagg cctcctcatg gtgatgccag tgagtcggac caggaaaggc 1680 
tcaccctggg gagcccttca gcctgatccg cagtcctaac agggtgagaa agcacaagag 1740' 
gggccagctc ggctgaggcc tgaggagaaa tgatgagaag ttttgtttcg ttcttgggct 1800 
ggggcatctc aggaacagtg agggcaacaa gctgtccctc agggaccctc acctgccctc 1860 
ccatgtgacc aggcccttcc caggtgggga ctggggcttg ctttattaag tgagctcttg 1920 
ggcttttgaa gtaggacaga gggccctggt ttggcagaga ggaggcaagg gagggcttta 1980 
gaacatctga gaaatgttaa taaataattc agaaaataaa aaaaaaaaaa aa 2032 



<210> 34 

<211> 1134 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1690348CB1 

<400> 34 

cgtggacttc taagaaagcg ccatggcctg tgctgctgtt atgattcctg ggttgttgcg 60 
gtgctctgtt ggagccatcc gtattgaggc tgcgtcactg agattgacac tcagcacttt 120 
gcgccatctt actctaacca gcataatgaa atccaaaagg aaaactgatc acatggagag 180 
aactgcaagt gtccttcgac gggagattgt ggcagcagct aaggtgtgtg gagctgccag 240 
tgagtcaccg tcagtgaaga gcctccgctt gcttgttgct gatcaagact tttcctttaa 300 
agctggccag tgggttgatt tctttattcc aggagtctct gtggttggtg ggttttcaat 3 60 
atgctccagt cccagactgc tagaacaaga gagagtgata gaattggcag tgaaatatac 420 
gaaccaccct cctgccctct gggttcacaa tacgtgtaca cttgactgtg aagtggctgt 480 
gagagtgggt ggagagttct tctttgaccc tcagcctgcg gatgcctcta gaaacctcgt 540 
gttgattgca ggaggagtcg gaattaaccc tctgctttcc atcctgcggc acgcagcaga 600 
tctcctcaga gagcaggcaa acaaaagaaa tggatatgag ataggaacaa taaaactatt 660 
ctacagtgca aaaaatacca gcgaactcct gtttaagaaa aatatccttg atttagtaaa 720 
tgaatttcct gagaagattg catgcagttt gcatgttaca aaacagacta cacaaatcaa 780 
tgcggaactc aagccataca tcacggaagg aagaataacg gagaaggaga taagagatca 840 
tatttcaaaa gagactttgt tctatatttg tggcccacct ccaatgacag actttttctc 900 
caagcaactg gaaaacaacc atgtacccaa agaacacatt tgctttgaga agtggtggta 960 
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ggaggcagac aaaggcagaa aaaataaaga ggtgagatct actcaggaga gctcctgtcc 1020 
tttgtggcat gattaatttt ttttatctct acttgagttg tcttattttt taaggctata 1080 
aacttagtga ccagctggat aataaaagcc agctggcaga cttaaatgat aaac 1134 



<210> 35 
<211> 734 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> unsure 
<222> 661 

<223> a or g or c or t, unknown, or other 
<220> 

<221> misc_feature 

<223> Incyte ID No: 1865603CB1 



<400> 35 

gggaggaccc tgggcaaaga cacctaccct 
ccagtcaaga tgtggggcag gactttggag 
cctcccccat gtcttatccg tagacttgac 
gacaccacca tgttttattc caagatcctg 
cggaaagcac tgtgttttgg agaccagaaa 
gaacccaaag ccgctcaccc agttcctggc 
cctttggagg aaatgatcca gcacctcaag 
gtccccagaa caggggcaaa agggcctatc 
aatctgattg aggtgtccaa ctacatctcc 
gtcccccttg atgtcgccct ctcctttcct 
ctccaaagac tgaggactta ggcacttcac 
nttgggacca aacctacatg tctgtatgtc 
ctcaatgaaa aaaa 



gccatgctgc gccatctgcc ctccaggctg 60 
aaacagtcat ggagggacag cagtcagacc 120 
cacatcgtga tgacggtgaa gagcatcaaa 180 
ggcatggagg tcatgacttt taaggaagac 240 
tttaacctcc acgaggtggg aaaggaattt 300 
tccctggaca tatgtctgat cacagaggtg 360 
gcttgtgatg tccctattga ggaggggcca 420 
atgtccatct acttccgaga ccccgacaga 480 
tcgtgatgga ggctggacct cctccattct 540 
tcctgcaaac ccccacccag gcccagagat 600 
acatcctgct gaggggggac ccaagacagg 660 
atcaaagttg gcctaataaa tgcataacct 720 

734 



<210> 36 

<211> 2221 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1976472CB1 

<400> 36 

gtcgggggct tggcctctgc ccggccacag 
cggcgatccc cgggcagggc ccggggccgg 
tctttcccgg cccccaggtc gcagcagcgt 
taaaacaggg cagaagcctt atggattgga 
cgggattaaa aggcaggtta attgaagtaa 
aagatgattg ttggatatgc ataagaggtt 
atcatcctgg tggagaagat gaactaatga 
ttgatcaggt tcatcgttgg gtcaattatg 
gaatggccat taaacctgct gttctgaaag 
atggcatgct tcccaagagc caagtgacag 
caagctatga ttggttccaa acagactctt 
aggatatcaa tttagactca ataatagttg 
caattattaa ggattgttta tatcttatac 
atttttctgt gcgggttgtt gagagtgtgg 
agaatacttc ttgggacttt cttggccatc 
ggaaagatac aggtttgtac tacagaaagt 
atgatacgag gcttttctgt ttgatgctgc 
ggcaacatgt ttacctcaag ctacctatta 
ctgtatctgg ttccttactc tcagagttca 
tctacttttt gataaaaatc tatcccactg 
agattggaga ttttgtttct gtaagcagtc 
aagaattaga agatctcttt ttgttggcag 



agccggagct ggaggtgctg tcccgtctgg 60 
ggtttgaaga tgctgaacgt cccttcccag 120 
gtcgcctccg gggggcgtac aaggtacctt 180 
ttcgactgac caaaagtgga aaggatctaa 240 
ctgaagaaga acttaagaaa cacaacaaaa 300 
tcgtttataa tgtcagccct tatatggagt 360 
gagcagcagg atcagatggt actgaacttt 420 
aatccatgct gaaagaatgc ctggttggca 480 
actatcgtga ggaggaaaag aaagtcttaa 540 
atacacttgc caaagaaggt cctagttatc 600 
tagtcaccat tgccatatat actaaacaga 660 
atcatcagaa tgattccttt agagcagaaa 720 
atattgggct aagccatgag gttcaggaag 780 
gaaaaataga gattgttcta caaaaaaaag 840 
ccctgaagaa tcataattca cttattccaa 900 
gccagttaat ttccaaggaa gatgttactc 960 
caccaagcac tcatcttcaa gtgcccattg 1020 
caggtacaga aatagtaaag ccatatacac 1080 
aggaaccagt tcttcccaac aataaataca 1140 
gactcttcac accagagctt gatcgtcttc 1200 
ctgagggcaa ttttaaaata tccaagttcc 1260 
ctggaacagg cttcacacca atggttaaaa 1320 
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tactgaatta 
ataaaacaga 
aaagactgga 
gacatatttc 
ttctcgtctg 
atctcaactt 
tcctttattc 
tttgtacata 
gttccttcag 
atatttgacc 
cacaaatttc 
gacatgataa 
caatacagat 
ttattaatgt 
tgtgtatggc 



tgctttgact 
agatgatata 
tgttgaattt 
accagctctt 
catttgtgga 
ttccaaaaat 
aactagttta 
acaaaaggtt 
tacaggtaac 
tggaaagtta 
cttgccattt 
agtaaatgat 
taaattttac 
taaaacatag 
attgatgcag 



gatataccca 
atttggagaa 
gttctctcag 
ctttctgaat 
ccagtgccat 
gagatccata 
tctaaatttg 
aactagaatc 
ttcttggctt 
atcatggcaa 
aaattatatc 
cacttcgatc 
cttgcactgt 
aatgttataa 
aatagaataa 



gtctcaggaa 
gccaattgga 
cacctatttc 
ttttgaaaag 
ttacagaaca 
gttttacagc 
tgattgctta 
cagccttcag 
tcttttgtac 
caaatacata 
actgttctac 
atgtttctat 
taactcagga 
cttattaagt 
aattatactt 



agtgaagctg 
gaaattagca 
tgaatggaat 
aaatttggac 
aggagtaagg 
ataatgaaga 
gggtttttta 
tttcttaaat 
cacaacttat 
caggattctt 
aataagcact 
gctgtaaaat 
aatgatcatt 
gatccaaaca 
aagttctttt 



atgttcttca 
tttaaagata 
ggcaaacagg 
aaatccaaag 
ttgctgcatg 
gctgtcattg 
agagaacatt 
gaaatcaaat 
tttactactg 
tgttatgaat 
tgtgttttat 
gtcttattag 
tatgtccttg 
tttttttgtg 
taaaaaaaaa 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2221 



<210> 37 
<211> 1706 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2050821CB1 

<400> 37 

agaagaggtt gtgtgggaca agctgctccc 
tggctgggcc tcagaccggt ggcaatgtcc 
tcctggctac tcgcccgcat cctggcttgg 
ctccagtgtt tcccacagcc cccaaaacgg 
actcctacag aggagggctt gaaggactcg 
tttacggtat ggctgggtcc catcatcccc 
cggtctatca ccaatgcctc agctgccatt 
ctgaagccct ggctgggaga agggatactg 
cgtcggatgc tgacgcccgc cttccatttc 
aacaagagtg caaacatcat gcttgacaag 
cgtctggaca tgtttgagca catcagcctc 
ttcagctttg acagccattg tcaggagagg 
ctcagtgccc ttgtagagaa aagaagccag 
tacctctccc atgacgggcg gcgcttccac 
gacgctgtca tccgggagcg gcgtcgcacc 
aaagacaaag ccaagtccaa gactttggat 
gaagatggga aggcattgtc agatgaggat 
ggaggccatg acaccacggc cagtggcctc 
ccagaatacc aggagcgctg ccgacaggag 
aaagagattg aatgggacga cctggcccag 
agcctgaggt tacatccccc agctcccttc 
ctcccagatg gccgagtcat ccccaaaggc 
catcacaacc caactgtgtg gccggatcct 
gagaacagca aggggaggtc acctctggct 
tgcatcgggc aggcgttcgc catggcggag 
cacttccggt tcctgccaga ccacactgag 
gccgagggcg ggctttggct gcgggtggag 
gacccatcca cctgtttttt tgcagattgt 
aaaaaaaaac taaaaaaaaa aaaaaa 



gacagaagga tgtcgctgct gagcctgccc 60 
ccatggctac tcctgctgct ggttgtgggc 120 
acctatgcct tctataacaa ctgccgccgg 180 
aactggtttt ggggtcacct gggcctgatc 240 
acccagatgt cggccaccta ttcccagggc 300 
ttcatcgttt tatgccaccc tgacaccatc 3 60 
gcacccaagg ataatctctt catcaggttc 420 
ctgagtggcg gtgacaagtg gagccgccac 480 
aacatcctga agtcctatat aacgatcttc 540 
tggcagcacc tggcctcaga gggcagcagt 600 
atgaccttgg acagtctaca gaaatgcatc 660 
cccagtgaat atattgccac catcttggag 720 
catatcctcc agcacatgga ctttctgtat 780 
agggcctgcc gcctggtgca tgacttcaca 840 
ctccccactc agggtattga tgattttttc 900 
ttcattgatg tgcttctgct gagcaaggat 960 
ataagagcag aggctgacac cttcatgttt 1020 
tcctgggtcc tgtacaacct tgcgaggcac 1080 
gtgcaagagc ttctgaagga ccgcgatcct 1140 
ctgcccttcc tgaccatgtg cgtgaaggag 1200 
atctcccgat gctgcaccca ggacattgtt 1260 
attacctgcc tcatcgatat tataggggtc 1320 
gaggtctacg accccttccg ctttgaccca 1380 
tttattcctt tctccgcagg gcccaggaac 1440 
atgaaagtgg tcctggcgtt gatgctgctg 1500 
ccccgcagga agctggaatt gatcatgcgc 1560 
cccctgaatg taggcttgca gtgactttct 1620 
catgaataaa acggtgctgt caaaaaaaaa 1680 

1706 



<210> 38 

<211> 549 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2408443CB1 
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<400> 38 

cccacgcgtc cgtgtcatgt acttttgtta 
tcaaggaggg caacatttta tggttacatt 
taaagcaaaa caatggctgg ttgaccataa 
agaaccattg cgtgcggaag atgtcaaagc 
agaattaatt tcaacacgct caaaaatttt 
gtcaattaat aaattgattg acctcatcgt 
tattcttgac gatcagcgca tgcaaattgg 
accacgtgaa gttcgtcagc gagagttaat 
agaggcaaaa gatttagtgg ttgaagaagg 
tgacaaaag 



atatttttgc tataataaac ttagtatatt 60 
atactcgtct ccgagttgca cctcttgtcg 120 
tctcccattt attgaacgta atttaaataa 180 
aatgttacga ttgactgaag atgggacgga 240 
ttctgagttg acgattgact tagatgatat 3 00 
catgtatccg tctttactga agcggccaat 3 60 
gtacaatgat gatgaaattc gtcgcttttt 420 
tcgcgcaaca tttaaagctg acttcgcaga 480 
ctgagcacgt cagtcttttt ttctttacca 540 

549 



<210> 39 
<211> 1363 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 

<223> Incyte ID No: 2508668CB1 

<400> 39 

cgtcaggctc tgtgttggtt ggagcgagca tgtgggtctg cagtaccctg tggcgggtgc 60 
gaacccccgc ccggcagtgg cgggggctgc tcccagcttc tggctgtcac ggacctgccg 120 
cctcctccta ctccgcatcc gccgagcctg cccgggtccg ggcgcttgtc tatgggcacc 180 
acggggatcc agccaaggtc gtcgaactca agaacctgga gctagctgct gtgagaggat 240 
cagatgtccg tgtgaagatg ctggcggccc ctatcaatcc atctgacata aatatgatcc 300 
aaggaaacta cggactcctt cctgaactgc ctgctgttgg agggaacgaa ggtgttgcac 3 60 
aggtggtagc ggtgggcagc aatgtgaccg ggctgaagcc aggagactgg gtgattccag 420 
caaatgctgg tttaggaacc tggcggaccg aggctgtgtt cagcgaggaa gcactgatcc 480 
aagttccgag tgacatccct cttcagagcg ctgccaccct gggtgtcaat ccctgcacag 540 
cctacaggat gttgatggac ttcgagcaac tgcagccagg ggattctgtc atccagaatg 600 
catccaacag cggagtgggg caagcggtca tccagatcgc cgcagccctg ggcctaagaa 660 
ccatcaatgt ggtccgagac agacctgata tccagaagct gagtgacaga ctgaagagtc 720 
tgggggctga gcatgtcatc acagaagagg agctaagaag gcccgaaatg aaaaacttct 780 
ttaaggacat gccccagcca cggcttgctc tcaactgtgt tggtgggaaa agctccacag 840 
agctgctgcg gcagttagcg cgtggaggaa ccatggtaac ctatgggggg atggccaagc 900 
agcccgtcgt agcctctgtg agcctgctca tttttaagga tctcaaactt cgaggctttt 960 
ggttgtccca gtggaagaag gatcacagtc cagaccagtt caaggagctg atcct;cacac 1020" 
tgtgcgatct catccgccga ggccagctca cagcccctgc ctgctcccag gtcccgctgc 1080 
aggactacca gtctgccttg gaagcctcca tgaagccctt catatcttca aagcagattc 1140 
tcaccatgtg atcatcccaa aagagctgga gtgacatggg aggggaggcg gatctgaggg 1200 
gctgggtgca ggcccctcag ttggggctcc caccttcccc agactactgt tctcctcact 1260 
gcctcttcct attaggagga tggtgaagcc agccacggtt ttccccaggg ccagccttaa 1320 
ggtatctaat aaagtctgaa ctctcccttc caaaaaaaaa aaa 13 63 



<210> 40 

<211> 1196 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 253 683 0CB1 

<400> 40 

ggttgccagg caaccgcgaa ccgcggtccc tgcctctgag tctctctcga ccatggccaa 60 
attcgtcatc gcgggtagag cagattgtcc atattatgct aaaacagaac ttgtggcaga 120 
ctatttacaa aagaatcttc ctgattttcg gatacataaa atcacacaac gtcctgaggt 180 
ttgggaggat tggctaaaag atgtgtgtga aaagaataag tggagtcaca agaattcccc 240 
tatcatctgg agagagctgt tggatcgtgg aggaaagggt ttgcttttgg gaggatataa 3 00 
tgagttcctg gagcatgctc agctttacta tgatgtcacc tctagcatga cgactgaact 360 
gatgatggta attgctcaag agaacctggg ggcacatata gaaaaagagc aggaggaaga 420 
agccctgaaa acttgcatca accccttgca ggtctggatc accagtgcct ctgctcctgc 480 
ctgctacaac ctaattccca tattgacgag tggcgaagtg tttgggatgc atacagaaat 540 
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tagcataact ctatttgaca acaagcaggc ggaagaacat ctcaaaagcc ttgtggtgga 600 
gacccaagac ctggcatctc ccgtcctgcg cagtgtctcc atctgcacga aggtggagga 660 
ggccttccgc caggcccacg tcattgtggt gctggatgac agcaccaaca aggaggtgtt 720 
cactctggag gactgcctcc gaagcagggt gcctctctgc aggctctatg ggtacctgat 780 
agagaaaaat gctcatgagt ctgtcagagt catcgtggga gggagaacct ttgtaaacct 840 
gaagacagtt ttactcatga gatatgcccc acgcattgca cacaacatta ttgctgtggc 900 
gctgggggtg gaaggtgaag cgaaagccat actggccaga aaactgaaga cagctccttc 960 
atgtgagtga atgtctcttt atgtttttgt ttttattttg aaacaacatg atttcaaagc 1020 
agctaggaag aacactgtcc cctctaggaa cattctgagt atctggaata ggaagtataa 1080 
aacagacata aagaacttat tgtgtgattt gactactgtg caaacatttt tttttactgg 1140 
cttttgtgtc tggctttttc catcattata aaaatcaata aatatttaat aagaaa 1196 



<210> 41 
<211> 1926 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2645179CB1 



*400> 41 

gcttcctctc ctggagtcca gagtcccgct 
gagtccggca gcatggaggc agcgcgccct 
ggcggcggca tcgcgggcgt cacttgtgcg 
gatattctct tggtaacagc ttctcctgtt 
tctaaaatat tggaagaatt cgatgttgaa 
tttcccaaca ttaaggttat agaatctggc 
attgtaacag aagatggcaa tcagcacgta 
aaaccaaagt tgatatgtga aggaaatcct 
gctcaggaat ttcagaaaca gcttactaaa 
ggtattgcac ttgagttagt gtatgaaatt 
gataaagcta tagggaatac tttcttcgat 
ctcattgctg aaaaatcaga ggctaaaatt 
ggaaggaaaa aggaagctag aagcaaatct 
ccagattggc atgaaggctt gaatcttaaa 
cttgaaacta tgtgtgaagt aaagaaaatc 
aaaaagtcct tcacttttcc aagagaccat 
cctgtctatg tggaattgac caatgaaaag 
acaggagtta caccaaatgt agaacctttt 
gatggtggcc tgaaagtgga tgatcatatg 
ggtgacatct gtactacatc ctggcagctg 
acccaggcta gacagatggg atggtatgca 
gactctattg acatggattt cagctttgaa 
tataaggttg tactgctggg aaaatacaat 
atgctgagat gtaccaaagg acgagaatac 
atgggagctg tcttaattgg tgaaaccgat 
aaccaaatga atctttcatc atatggagaa 
gattattttg actaaaaatg gaatttcttc 
gaaaaatcac aagtcaataa aatgaatgac 
attacagaag tgataatgat attagtggaa 
atcagttcaa agtttattta tagatatatc 
aaatcttaag ttatttattt ctgtgtttta 
ttggagcaaa tttaggtaag ttatctactt 
ttcttt 



gctccgccgc gatattcagt aaaccactgg 60 
cccccgacgg cagggaagtt cgtggtggtc 120 
gagcagttgg ctactcactt tccatcggaa 180 
attaaagcag ttacaaattt caagcagatt 240. 
gaacaatcaa gtaccatgtt aggaaaacgc 300 
gtaaagcaac tgaagagtga agaacactgc 360 
tataagaaac tctgtctgtg tgctggagct 420 
tatgtattag gaatccgtga tacagacagt 480 
gctaaaagaa taatgatcat agggaacggt 540 
gaaggctgtg aagtgatttg ggccattaaa 600 
gcaggagcag ctgaattctt gacttcaaag 660 
gcacataaaa gaaccagata tacaactgaa 720 
aaagcagata atgtaggaag tgcattggga 780 
ggaacaaaag agttttctca taagattcac 840 
taccttcagg atgagtttag aattttgaag 900 
aagtcagtta cagctgatac agagatgtgg 960 
atatatggct gcgatttcat tgtcagtgct 1020 
ctccatggta acagttttga tctaggagaa 1080 
cacacatccc ttcctgatat ctatgctgcc 1140 
agcccagtct ggcagcagat gaggctgtgg 1200 
gcaaagtgca tggctgcagc gagttcagga 1260 
ctgtttgctc atgtgacaaa attttttaac 1320 
gcacagggct taggttcaga tcatgaatta 1380 
atcaaagtcg tcatgcaaaa tggacgaatg 1440 
ttagaagaaa catttgaaaa cctaatctta 1500 
gatctgctag atccaaatat tgatatagaa 1560 
aggaatcata taaagttcca aatgacacca 1620 
tgtattgagt taatgatgac cacactgaaa 1680 
aaatataaaa acataaattc taagtttgaa 1740 
tttccaatac aacactgacc gcttagataa 1800 
aacataaata tgtttacttg tgatttagct 1860 
agccaaatgt actctagtag actagaacca 1920 

1926 



<210> 42 

<211> 1727 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2754425CB1 
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<400> 42 

agggaccgac ggcacttgat aatgtgacaa 
cttttgccgg aaagtggagc agtgggcacg 
tggcagtatc tgaggatcgg tggcagccat 
cccacgctaa tctgcatggt gtgggcgccc 
ttgcttcaca cgcccagtaa ggactccccc 
agctggtcgc tggatggcct gagggactgc 
tgcgacacca ccgctgtcat cactgtggcc 
tatcacgtgg gcagggagca gccccggccc 
gccgatgcca acgggctgca gaatggctac 
tgcacccaca acgagggcct caaccagaag 
cgctactcct ggtccggcat gggccgcatc 
ctcaacagcc ggccctccat ccagaagccc 
acgccctatt tctcccggga cgcacagaaa 
cagaccatcc tgtgtgagtt tgagaccctc 
caaggatgga aaatgaacag cacccccagc 
cagggggttt gtgttcccag gaactgtagg 
agccttcgga cctgtattgg gaacaatgtt 
cctgggactg tgataacgga gcactatgga 
ggtctgaaaa ctccaaatgg ctgtgagctg 
gaagggcgct gccttctctt tgatgactct 
gcagaggatg gcccacgggt ggttttcatg 
gccgaacggc aggctcttga tttcatcttt 
tggagtcggc gagaagggcc gaggcggggc 
accggtgttg tttccatgct cagaaacctg 
atatcatgtc gggtccctct ttcccttggt 
ttccttagat tttttttttt tccttccaat 
acagcgcatt cctttgattg gtccttgagt 
tcttctgttg ctacttgttt ttttcagtgc 
tctgaatata ataatgtaaa acttcttgtg 



aaaccagcct cttccacccc actgcagtga 60 
gccaaccttg tcgttcatca atgccgcccc 120 
gcctccccct gccccagccg ctccttcccc 180 
ttgggacccc cgaggactga ttgtctgacc 240 
aagatgtcgc tcgagtggct ggtggcctgg 3 00 
atcgccaccg gcatccagtc cgtgcgggac 360 
tgcctcctgg tcctcttcgt gtggtactgt 420 
tacgtctccg tcaactccct catgcaggct 480 
gtgtactgcc agtcccctga gtgcgtgcgc 540 
ctgtaccaca acctgcagga gtacgccaag 600 
cacaagggca tccgcgagca gggccggtac 660 
gaggtcttct tcctgcccga cctgcccacc 720 
catgatgtgg aagtgctgga acggaacttc 780 
tacaaagctt tctcaaactg cagcctcccg 840 
ggggagtggt tcacctttta cttggtcaat 900 
aagtgcccac ggacgtaccg cttgctcgga 960 
tttgggaacg cgtgcatctc tgtgctgagc 1020 
cccaccaaca tccgcatccg atgccattta 1080 
gtggtggggg gagagcccca gtgctgggca 1140 
ttcctgcatg ctgcgttcca tgaaggttca 1200 
gtggatttgt ggcatccaaa cgtcgcagcg 1260 
gctccgggac gatgagagta tttcccatgc 1320 
ctgggcagac tgtggtccgg tccagtccct 1380 
cctcagcgga aagctcttat ttgggatttt 1440 
tattgtaaat ggaaactttt cggcttgtat 1500 
catttgcttc agagactcct ttctggccta 1560 
gaccagagac ttagtgccct tgtaagtctg 1620 
tctgaaatag agtaactaaa tggttatttg 1680 
gtcatcttaa aaaaaaa 1727 



<210> 43 
<211> 611 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2821526CB1 

<400> 43 

cccatccggt gggttttgct cggcgcaccc 
ttagtgcagg tcctgaaacg cgggctgcag 
tatctacggg tttttttcag gacaaatgat 
aaatatggaa acaaatacta tgaagacaac 
gtatatacta ctgaaatgaa tggcaaaaac 
cctcctgaat ggcatcgttg gcttcacagt 
cttactgctc gtaaattcat ttggacgaac 
caatatgtac cttattctac cactagaaag 
ccttacaagt aaagacaatg aagaacagtt 
aattactctt ttactgttta ccattcacta 
aaaaaaaaaa a 



gggaggcggg gccagcgagg caagatggag 60 
cagatcaccg gccacggcgg tctccgaggc 120 
gcgaaggttg gtacattagt gggggaagac 180 
aagcaatttt ttggccgtca ccgatgggtt 240 
acattctggg atgtggatgg aagcatggtg 300 
atgactgatg atcctccaac aacaaaacca 360 
cataaattca acgtgactgg caccccagaa 420 
aagattcagg agtggatccc accttcaaca 480 
gaaacatgca aaatatggag cttttcatgt 540 
taattcacaa ttaaaattgt gtgactaaac 600 

611 



<210> 44 
<211> 1352 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 28764 94CB1 
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<400> 44 

gttttattga 

aatcaagtta 

ggcaaaacgc 

ggttggcctt 

agctgcaaga 

gccttggcta 

gtcagactca 

ttctcattgg 

aggaagcttt 

tcttcaaacc 

actctgggac 

tctattttgg 

ttactgagga 

agcctcacca 

ttggagtggt 

agcagtgaca 

ccaagctagc 

actcccacag 

cgcagtaacc 

gtacccagga 

tcattttcta 

gtgtattcat 

taattcacag 



caatacatgc 
tgatttatag 
ctaccgagac 
ctacctgcct 
agctgtgggg 
taaggaggag 
gcgcttagaa 
caaggaattt 
ccggttcacc 
ctttcaagcc 
atatcgctgt 
gttgagggtc 
tcagaagtta 
cccaaagtgg 
tggtggcgtg 
gcttcaagaa 
tttttagggg 
ccaaaagaat 
taacttcctt 
aactgcaagc 
ccctttatga 
gtagagggat 
tagctgtatg 



atcatatctt 
aggattcagc 
atgaaggttt 
ttggtggtga 
aaagttatca 
accgtctgtg 
tgtctgacca 
gagcttagct 
tggagacttg 
aactcccact 
gatgtgcagc 
cttcctccta 
atagatgagg 
aaaaagaagg 
ttggtgagga 
cttaacagcc 
agtgttccag 
gagtgggaga 
atgtcccatg 
attgcctgtg 
cttggatgcc 
gtattcagcc 
gagacctttt 



ttgactttga 
tggaatacct 
tagccactag 
ctacacctaa 
tcaatgccac 
aggtgggccc 
actggatctg 
gtctgagttc 
ctcgaggtgt 
ttgtgaagtt 
tggtaaaaaa 
acttggtgaa 
gattggaagt 
tggcgtcagc 
ttgtcctctg 
ttgctcctga 
ctgctggtag 
aatggagggg 
gatctcttcc 
ttcctgggaa 
tccccacctc 
tttttagtga 
tc 



aggatatctc 
tgtgggtgct 
ttttgtcctt 
aacactggcc 
aacctgtact 
tgatggagtg 
tgggatgctc 
agacatcttg 
catctccact 
taaatatgct 
cttgagactt 
tctgaatttc 
taatctggac 

cttgggaata 

tgcgctaagg 
agaactggct 
tggatcagct 
acaatctcct 
tgatcttccc 
gagttctaag 
catttcccct 
acattttttt 



atgtcaaagg 
ggctgagggt 
gggagcctgg 
atccctgaga 
gtcacctgtg 
agaaggaaat 
catttcacca 
gagtttggac 
gacgatgagg 
caggagtatg 
gtcaagaggc 
catcagtcac 
agctactcca 
ggaattgcca 

gggggcctgc 

gcccaggaag 
tagagggaac 
gggagctatg 
tgcccattgg 
aagcttgcat 
cttctgagct 
tcaataaaag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1352 



<210> 45 

<211> 1458 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 3403225CB1 



<400> 45 

ggacggagag 

tgagccagag 

ggaacgatgc 

gctccatcca 

ggcaaggtag 

ctcgctagcc 

gctgccagtg 

ctatccgaca 

ctccatattc 

ggctttgaaa 

cacccaggcg 

cggctcttct 

gccctggctg 

tgggtgtctc 

gagcttctag 

tccatgccat 

ccagctggtg 

acacctgaca 

cttctctaag 

ggctgggcat 

agaggctcag 

caactcttgg 

aacacacaaa 

tcccatttca 

ttagagaaaa 



gagcagagaa 
ctggggttga 
tggtcacctt 
tcaggaagtt 
tggtgatcac 
gaggagcccg 
aaatccgagt 
ccaaatctat 
tgatcaacaa 
cccacctggg 
tcgtccgctc 
ccccctttgt 
agggcctgga 
caagggcccg 
gaatccggtg 
aatgaacagg 
ctgcgaatcc 
ctcttgtgag 
acctggaaag 
gggggtggca 
ctgaggcaag 
ttgatctctt 
gatcctggct 
atcatgcaga 
taaagact 



gcagcagaag 
agctggagca 
gggactgctc 
ctttgctggt 
tggcgccaac 
agtctatatt 
ggatacaaag 
ccgagccttt 
tgcgggagta 
agtcaaccac 
tgagctggtc 
caagacggca 
gcccctgagt 
aaataacaaa 
ggagtagctg 
gaccaaggag 
tgcctgctct 
actggcttat 
tcagcaaccc 
gaagagcccg 
aagagcacca 
tctttttaaa 
aaccctggcc 
tggtttcttt 



cagccaagag 
gcagcaaaag 
acctccttct 
ggagtgtgta 
acgggcattg 
gcctgcagag 
aactcccagg 
gctgagggct 
atgatgtgtc 
ctgggcaccg 
cggcactcct 
cgggaggggg 
ggcaagtact 
ac age t gage 
gtggaagagc 
aaggccaacc 
gatcctcttg 
ggcatgagtt 
tctgggggca 
agaaattggg 
tcactgccta 
aatatttgee 
tatttagatt 
ttgtaaagag 



ctggagccag 
caacagcagc 
tctcgttcct 
gaacaaatgt 
gcaaggagac 
atgtactgaa 
tgctggtgcg 
ttctggcaga 
catattccaa 
gggtcaccac 
ccctgctctg 
cgcagaccag 
tcagtgactg 
gcctatggaa 
tgcagcttta 
ctaaaggatt 
acccttctgg 
gtggacacct 
gcaggactgg 
tcagttccct 
tttctagggg 
accaccctgg 
ccttcctctc 
ttccgtttgc 



accaggaacc 
tacagaagtt 
gtatatggta 
gcagcttcct 
ggecagagag 
gggggagtct 
gaaattggac 
ggaaaagcag 
gacagctgat 
etaegcagtg 
cctgctctgg 
cctgcactgc 
caagaggacc 
tgtcagctgt 
tcaggcccaa 
gtcctcttgg 
gaatgtttgc 
atagagtgtt 
gcagatccca 
catcagcacc 
ctatacactc 
agtctagacc 
acctggacct 
ctttcaattt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1458 



<210> 46 

<211> 1884 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<223> Incyte ID No: 4163943CB1 

<400> 46 

ggactccaag cgccatggcc gctgccgccc 
tgcaacgcgc agcgtgccag tgcccaactc 
tttcaccttc tgggaaaaca acagattatg 
atggagcagc agttacaaag gaagtaggaa 
tgtgcttgat gacagacaag aacctctcca 
ccctagtgaa gaatggcatc ccctttacgg 
attcaagctt catggaagct attgagtttg 
ctgtcggtgg tggctctacc atggacacct 
ctcattctga tttcctagat tatgtcagtg 
tgcctcttaa gcctctgatt gcagtgccaa 
gggttgccat ttttgactat gaacacttga 
tcaaacccac actgggactg attgatcctc 
tcgccaacag tggctttgat gtgctttgcc 
accacctgcg gagcccctgc ccttcaaatc 
acccaatcag tgacatttgg gctatccacg 
gggctgtcag aaatcccgat gatcttgaag 
ttgctggcat cggctttgga aatgctggtg 
tttcaggttt agtgaagatg tataaagcaa 
cccatggcct ttctgtggtg ctcacgtccc 
ttccagagcg acacctggag atggcagaaa 
tccaagatgc agggctggtg ttggcagaca 
ttgatgatgg cctagcagct gttggttact 
gaacgctgcc ccaggaaagg gtcaccaagc 
tggctgctct gtttgaagct tcaatgaaac 
taccgctggc cattgtagtg ctgagagcaa 
atctttgtgc ataacttacc tgttaccagt 
aattccccaa agctcagagt ccagttcctt 
tgttagaccc ccaggctcga cttcaggggt 
tactctgcct ctgcttcatg tagcaaatga 
attatatttc ctatgcctag taattcactt 
tagcaccagt acatacagaa gcatggcaag 
aaagatttga aacaaaaaaa aaaa 



gagcccgggt cgcgtacttg ctgaggcaac 60 
attctcatac ttactcccaa gcccctggac 120 
cctttgagat ggctgtttca aatattagat 180 
tggacctaaa aaacatgggt gctaaaaatg 240 
agctccctcc tgtgcaagta gctatggatt 300 
tttatgataa tgtgagagtg gaaccaacgg 3 60 
cccaaaaggg agcttttgat gcctatgttg 420 
gtaaggctgc taatctgtat gcatccagcc 480 
cccccattgg caagggaaag cctgtgtctg 540 
ctacctcagg aaccgggagt gaaactactg 600 
aagtaaaaat tggcatcact tcgagagcca 660 
tgcacaccct ccacatgcct gcccgagtgg 720 
atgccctgga gtcatacacc accctgccct 780 
ccatcacacg gcctgcgtac cagggcagca 840 
cgctgcggat cgtggctaag tatctgaaga 900 
caaggtctca tatgcacttg gcaagtgctt 960 
ttcatctgtg ccatggaatg tcttacccaa 1020 
aggattacaa tgtggatcac ccactggtgc 1080 
cagcggtgtt cactttcacg gcccagatgt 1140 
tactgggagc cgacacccgc actgccagga 1200 
cgctccggaa attcttattc gatctggatg 1260 
ccaaagctga tatccccgca ctagtgaaag 1320 
ttgcaccccg tccccagtca gaagaggatc 1380 
tgtattaatt gtcattttaa ctgaaagaat 1440 
gagctgatct agctagggct ttgtcttttc 1500 
ataggtggga tatacattta tcttgcagga 1560 
ccataaaaca ggctggacaa atgaccacta 1620 
cagtgttcct gtcccaaacc ccacacagaa 1680 
gcaaaaactc agtatctatc aaaagtgtaa 1740 
catgtctaaa aatttatctg atagaaacac 1800 
gatgtttctg gcagcacttt tctaataata 1860 

1884 



<210> 47 

<211> 1400 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4293484CB1 

<400> 47 

ctccaggcca gtctgggcac cctgggatag 
tcccgccaaa cgccagcctc gtcaccgctc 
cagcaggtgg cagccagccc ctggatgagc 
actctgcaca ggcccagaag ctggtgtacc 
agcacgtggt gcgaatgctg ctgcagcggg 
accaacacct gggtccctgg ctggaggagc 
cttgtgtgca gaccggaaca cggttcctgg 
ctaacaccaa aggtcacccc ttctacaggg 
acaggcaccc ctatccttgc agcaaggccc 
ggaggaaggt ccgtgggggg ctgcccctgg 
acggtgaagg ccaccagatc atgagggact 
ggctcttccg ggccatcccg gcctctgtgg 
cctggatgca cgtgctggca gcccgggagc 
aggtatactt ctgctacgat ggatcaccct 
tcctgggccc ctgcggactg cggctggtgg 
tggtgttcct ggctgccctc aatgccctgc 
acgcacccct gctgaacccc tacacgctgg 
ccgacaaggc tcagcgccat ttcggctatg 



cggctgcagc tcccaggatc agctctgccc 60 
cagggcacct ccagcagtaa caggtggttg 120 
caaggtctct tccccagcca ggcatggccg 180 
tggtcacagg gggctgtggc ttcctgggag 240 
agccccggct cggggagctg cgggtctttg 300 
tgaagacagg tacccggaac gtgatcgagg 360 
tctacaccag cagcatggaa gttgtggggc 420 
gcaacgaaga caccccatac gaagcagtgc 480 
tggccgagtg gctggtcctg gaggccaacg 540 
tgacgtgtgc ccttcgtccc acgggcatct 600 
tctaccgcca gggcctgcgc ctgggaggtt 660 
agcatggccg ggtctatgtg ggcaatgttg 720 
tggagcagcg ggcaaccctg atgggcggcc 780 
acaggagcta cgaggatttc aacatggagt 840 
gcgcccgccc attgctgccc tactggctgc 900 
tgcagtggct gctgcggcca ctggtgctct 960 
ccgtggccaa caccaccttc accgtcagca 1020 
agcccctgtt ctcgtgggag gatagccgga 1080 
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cccgcaccat tctctgggta caggccgcta cgggttcagc ccagtgacgg tggggctggg 1140 

gcctggaggc ccagatacag cacatccacc caggtcccga gccctcacac cctggacggg 1200 

aagggacagc tgcattccag agcaggaggc agggctctgg ggccagaatg gctgtccttg 1260 

tcgtagagcc ctccacattt tctttttctt ttttgagaca gggtcttgct ctgtcaccca 1320 

gactggagtg cagtggtgtg atcatagctc actgcaccct caacctcctg ggttcaagca 1380 
atcctcctgc ctcagcctcc ~ 1400 



<210> 48 

<211> 1313 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4440080CB1 

<400> 48 

gaggctggct agggagccgt gggagtgagc 
tcataggagc tggctgctga atgtaagagt 
agatgtgacc tatcaaatga agaggacatc 
cacagcggtg tagacatctg catcaacaat 
tcaggcagca ccagtggttg gaaggacatg 
tgcacacggg aagcctacca gtccatgaag 
aacatcaata gcatgtctgg ccaccgagtg 
gccaccaagt atgccgtcac tgcgctgaca 
cagacccaca tccgagccac gtgcatctct 
aaactccacg acaaggaccc tgagaaggca 
aaacccgagg atgtggccga ggctgttatc 
attggagaca tccagatgag gcccacggag 
cttccctccc cacccttcat ggcttgcctc 
gatcacggga taccacttcc tgtccacacc 
tttttatatc atcttgtcaa attgcttcag 
gaggtggtgt ccctaattgt tttacttgtt 
ctttgtctgc tctcagtgtc ttccctttga 
atcttcttgc acctcaacgt ctgtggctca 
ttatatctgt gttgttatcc agggctccag 
tctccccctt atctatctcc ttctcggctc 
ctggggtcat ccctccactc tgactctgac 
gtggatttca tggtgatcat taaaaaagaa 



tgctcacccc agtcagaccc ctgttgggtt 60 
gcaggctacc ccgggacttt gatcccctac 120 
ctctccatgt tctcagctat ccgttctcag 180 
gctggcttgg cccggcctga caccctgctc 240 
ttcaatgtga acgtgctggc cctcagcatc 300 
gagcggaatg tggacgatgg gcacatcatt 360 
ttacccctgt ctgtgaccca cttctatagt 420 
gagggactga ggcaagagct tcgggaggcc 480 
ccaggtgtgg tggagacaca attcgccttc 540 
gctgccacct atgagcaaat gaagtgtctc 600 
tacgtcctca gcacccccgc acacatccag 660 
caggtgacct agtgactgtg ggagctcctc 720 
ctgcctctgg attttaggtg ttgatttctg 780 
ccgaccaggg gctagaaaat ttgtttgaga 840 
ttgtaaatgt gaaaaatggg ctggggaaag 900 
aacttgttct tgtgcccctg ggcacttggc 960 
catgggaaag gagttgtggc caaaatcccc 1020 
gggctggggt ggcagaggga ggccttcacc 1080 
acttcctcct ctgcctgccc cactgcaccc 1140 
cccagcccag tcttggcttc ttgtcccctc 1200 
tatggcagca gaacaccagg gcctggccca 1260 
aaatcgcaac caaaaaaaaa aaa ~ 1313' 



<210> 49 

<211> 1459 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5495687CB1 

<400> 49 

gggccgcgcc cgcgctccca ggctctcctc 
tcgcgtagcc gtgcgccgat tgcctctcgg 
gaccgccccg cgtcatgcgg ctcctcggct 
ttcccgtccg cggcgtggag gttgcagagg 
ctgctcaccc tctccaggtg ggggctgtgt 
cgatgggcca ggacagggca gcagaagagg 
gcgatcacat ggtgatgctg tctgtgattc 
agcctagcgg cgtcacctgt ggtgctggag 
agagcctttt ctctctggat ggcgctggag 
acacagagcc agaagtggcg gaatctgacg 
aaagtctgaa atccccaaag gtgaactgtg 
tcactctgaa aattttaaat atgtcacagg 
gtgactgtac tctagtcctg ttttacaccc 
ctcactttaa ctctctgccc cgggcatttc 
ctcagcacag cagcctttct accaggtttg 



ccccagcctt cctccggctg gcagcacgac 60 
cctgggcaat ggtcccggct gccggtcgac 120 
ggtggcaagt attgctgtgg gtgctgggac 180 
aaagtggtcg cttatggtca gaggagcagc 240 
acctgggtga ggaggagctc ctgcatgacc 3 00 
ccaatgcggt gctggggctg gacacccaag 360 
ctggggaagc tgaggacaaa gtgagttcag 420 
gagcggagga ctcaaggtgc aacgtccgag 480 
cacacttccc tgacagagaa gaggagtatt 540 
cagccccgac agaggactcc aataacactg 600 
aggagagaaa cattacagga ttagaaaatt 660 
accttatgga ttttctgaac ccaaacggta 720 
cgtggtgccg cttttctgcc agtttggccc 780 
cagctcttca ctttttggca ctggatgcat 840 
gcaccgtagc tgttcctaat attttattat 900 
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ttcaaggagc taaaccaatg gccagattta 
aaatcttcat ttttaatcag acaggtatag 
ccgaccaaat aggccctctt cccagcactt 
tttccttatt ctttttaatt agttttatta 
ggtggctaat tccaggacaa gagcaggaac 
ggaaagagga acttcaatcc ttcgtttcag 
ccagtgacgt gttgacttga aacttcaggc 
aatgtataaa aaaattataa actggtgttt 
atattcaata gatgcactat tcttgttttt 
gtaatccagt ttgaaatgg 



atcatacaga tcgaacactg gaaacactga 960 
aagccaagaa gaatgtggtg gtaactcaag 1020 
tgataaaaag tgtggactgg ttgcttgtat 1080 
tgtatgctac cattcgaact gagagtattc 1140 
atgtggagta gtgatggtct gaaagaagtt 1200 
aaattagtgc tacagtttca tacattttct 1260 
agattaaaag aatcatttgt tgaacaactg 1320 
taactagtat tgcaataagc aaatgcaaaa 1380 
actgcatgaa cgtaattcag tatttggaaa 1440 

1459 



<210> 50 

<211> 2101 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5527735CB1 

<400> 50 

cggctcgagg cggccgtcat ggcgtcgccc 
gatgacttca tcgggccgtc tcaggagtgc 
ggaagtggcg tggccaagat tcgcattgaa 
gacggcggga cccggaggct ggagaaggcc 
agcggctgca tcacctccgc agagaccgtg 
aagaaggttc tagatgctaa caagatggcg 
tcggtctcac cacagtctag agcatcgctg 
actgccagga aattaacctc attctttaaa 
gccttctcaa ggcacttcag cctcctggag 
ggacaggccg actgcagaca ggcactgccc 
tgctatgccg agaagactca cggcagcttc 
ccgcagcagg tcatgggctc cctggtcaag 
cctgacaaga tctaccacgt cacagtgatg 
agacccgact ttttcaacca ggagcaccag 
ggagaagttt tcaggttgct ggaggaagag 
cctctggaca gcctgtgcag cggtgcctct 
ggctcggggg gctacctgga gcatgtgttc 
catgtggctg aggttaccta caaacccctg 
gagaaggagg gccaggtgct gctgcacttc 
aacctggtgc agaggctcaa acgagggcgc 
tgcccctcag gctgcctgaa cggcgggggc 
gage tec tec agcacgtgga gagactgtac 
gcgcctgggg ttcaggagct gtacacacac 
ggtcgcttgc tgeatacgea gtaccacgcc 
cggtggtagg ggctgcagga ccaggactcc 
accacatgcc ccaagacccc agggcttccc 
tgggacccga gtaggagcta ggactageca 
gggtgcctct gggttcccac tggctctgcc 
aggttccctg aggtcccaga gcctgttccg 
cccttgctgc tgctggtcgg gcacccaagt 
gectgaggea gagcaagacg ggttctcacc 
tgtgcaaaag gtgggagaca gagctggacc 
gtggaccctg gcaggggggg ggtgccaggc 
ctgtggtccg gggcaageca agtacgaagc 
gccaggtccc caccaggcct gtctcccagc 

g 



ttcagcgggg cgctgcagct gaeggacctg 60 
ateaagectg tcaaagtgga aaaaagggcg 120 
gatgaeggga gctacttcca aattaaccaa 180 
aaggtctege taaacgactg cctggcgtgc 240 
cttatcaccc ageagageca cgaggagctg 3 00 
gcacccagtc agcagaggct ggttgtagtt 3 60 
getgeaeggt ttcagctgaa tcctacagat 420 
aaaatagggg tgcacttcgt cttcgacacc 480 
agecagegag agtttgtgcg gegattcega 540 
ctgctggcct ctgcctgccc aggctggatc 600 
atcctccccc acatcagcac cgcccggtcc 660 
gacttcttcg cccagcagca gcacttgacc 720 
ccctgctatg acaaaaagct ggaagcctcc 780 
acaegggatg tggactgtgt cctcacaaca 840 
ggcgtctccc tccccgacct ggaaccagcc 900 
gcagaggagc ccaccagcca teggggaggg 960 
cggcacgcgg cccgagagct ctttggaatc 1020 
aggaacaaag acttccagga ggtgacactg 1080- 
gcaatggcgt acggcttccg caacatccag 1140 
tgcccctacc actacgtgga ggtcatggcc 1200 
cagctccagg ccccagacag gcccagcaga 1260 
ggcatggtcc gggctgaggc gcccgaggac 1320 
tggctgeagg gcacggactc ggagtgtgca 1380 
gtggagaagg ccagcactgg cctgggcatc 1440 
caggaggecg tgtccatgtg tgacagcaga 1500 
ccaaaattct gagtgagctg cagggtgtgc 1560 
ggacccgcag ccgcctcgtc acctccagtt 1620 
caggtggggt ggggtggccc aggcagcaga 1680 
ttggccctgg gccgaggccc acaggtgctg 1740 
gcgtgagggg cttcagcctg teceggggtt 1800 
cctgacttct ggaggcttcc cttgaagctc 1860 
tgcaggggtg gtcccgccac aaccctgcgt 1920 
ccctggaaag caggggttac cgttacgagg 1980 
agcagccatc gegggctgea tcatccccca 2040 
gtttgtctaa taaacgcacc ectegtaaaa 2100 

2101 



<210> 51 

<211> 2440 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
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<400> 51 

gtgtccctgc ggcgctaaga aggggagact gaggctgagg ctggggaaca tcgggcagca 60 
tgagcggctg cgggctcttc ctgcgcacca cggctgcggc tcgtgcctgc cggggtctgg 120 
tggtctctac cgcgaaccgg cggctactgc gcaccagccc gcctgtacga gctttcgcca 180 
aagagctttt cctaggcaaa atcaagaaga aagaagtttt cccatttcca gaagttagcc 240 
aagatgaact taatgaaatc aatcagttct tgggacccgt ggaaaaattc ttcactgaag 300 
aggtggactc ccgaaaaatt gaccaggaag ggaaaatccc agatgaaact ttggagaaat 3 60 
tgaagagcct agggcttttt gggctgcaag tcccagaaga atatggtggc ctgggcttct 420 
ccaacaccat gtactcaaga ctaggggaga tcatcagcat ggatgggtcc atcactgtga 480 
ccctggcagc gcaccaggct attggcctca aggggatcat cttggctggc actgaggagc 540 
agaaagccaa atacttgcct aaactggcgt ccggggagca cattgcagcc ttctgcctca 600 
cggagccagc cagtgggagc gatgcagcct caatccggag cagagccaca ctaagtgaag 660 
acaagaagca ctacatcctc aatggctcca aggtctggat tactaatgga ggactggcca 720 
atatttttac tgtgtttgca aagactgagg tcgttgattc tgatggatca gtgaaagaca 780 
aaatcacagc attcatagta gaaagagact ttggtggagt cactaatggg aaacccgaag 840 
ataaattagg cattcggggc tccaacactt gtgaagtcca ttttgaaaac accaagatac 900 
ctgtggaaaa catccttgga gaggtcggag atgggtttaa ggtggccatg aacatcctca 960 
acagcggccg gttcagcatg ggcagcgtcg tggctgggct gctcaagaga ttgattgaaa 1020 
tgactgctga gtacgcctgc acaaggaaac agtttaacaa gaggctcagt gaatttggat 1080 
tgattcagga gaaatttgca ctgatggctc agaaggctta cgtcatggag agtatgacct 1140 
acctcacagc agggatgctg gaccaacctg gctttcccga ctgctccatc gaggcagcca 1200 
tggtgaaggt gttcagctcc gaggccgcct ggcagtgtgt gagtgaggcg ctgcagatcc 1260 
tcgggggctt gggctacaca agggactatc cgtacgagcg catactgcgt gacacccgca 1320 
tcctcctcat cttcgaggga accaatgaga ttctccggat gtacatcgcc ctgacgggtc 1380 
tgcagcatgc cggccgcatc ctgactacca ggatccatga gcttaaacag gccaaagtga 1440 
gcacagtcat ggataccgtt ggccggaggc ttcgggactc cctgggccga actgtggacc 1500 
tggggctgac aggcaaccat ggagttgtgc accctagtct tgcggacagt gccaacaagt 1560 
ttgaggagaa cacctactgc ttcggccgga ccgtggagac actgctgctc cgctttggca 1620 
agaccatcat ggaggagcag ctggtactga agcgggtggc caacatcctc atcaacctgt 1680 
atggcatgac ggccgtgctg tcgcgggcca gccgctccat ccgcattggg ctccgcaacc 1740 
acgaccacga ggttctcttg gccaacacct tctgcgtgga agcttacttg cagaatctct 1800 
tcagcctctc tcagctggac aagtatgctc cagaaaacct agatgagcag attaagaaag 1860 
tgtcccagca gatccttgag aagcgagcct atatctgtgc ccaccctctg gacaggacat 1920 
gctgaggcag gggacagtgt cccctgctac cgcccgcccc tacccatggc ccgttgctgg 1980 
atgactgtta ctcttttttc agaaggtgtt gggattatca caggttaagc cttttgttcc 2040 
ccgtctgcac ctgaagggtt gtcgcctggc ctgggagagc ctcttccagg ttttgacctg 2100. 
caggcagtgc tctctaacag gaccatcaca gcttctgaac tgagccggag agagagaatg 2160'? 
gaattgctga cccctggaac tggcgggtat tctggtcatt gaggagacac catagtggaa 2220 
actggggctt atgctgctgc ctccagggtg tgaggtgggt ggggacctgt gtcaggtgtg 2280 
gatagccatt tctgctcaac cacacattct ctaagaaaca gcttgaaagc tctgtctggg 2340 
tcattcattt aaactagaag cagaggcact taaaacatgt accaggaacc atttaacaaa 2400 
gaatataaaa tgtcacaatc tgtgtactgt taaaaaaaaa 2440 



<210> 52 

<211> 1072 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> raisc_feature 

<223> Incyte ID No: 5596281CB1 

<400> 52 

cggacggtgg gtccagacaa aggttgtctc 
acgccatcag ctcgctgctt aaaattaaac 
gaaagtcatc atcctgacgg ccgctgctca 
tgcaagagaa ggtgccaaag tcatagccac 
ggaaaagtac ccgggtattc aaactcgtgt 
tcagtttgcc agtgaagttg agagacttga 
tcatggaact gtcctggatt gtgaggagaa 
gcgcagcatg tacctgatga tcaaggcatt 
caatattatc aacatgtctt ctgtggcttc 
gtacagcaca accaaggcag ccgtgattgg 
ccagcagggc atcaggtgca actgtgtgtg 



aagtttgttg ctcaaaccga gttctggaga 60 
cacaggttcc attatgggtc gacttgatgg 120 
ggggattggc caagcagctg ccttagcttt 180 
agacattaat gagtccaaac ttcaggaact 240 
ccttgatgtc acaaagaaga aacaaattga 300 
tgttctcttt aatgttgctg gttttgtcca 360 
agactgggac ttctcgatga atctcaatgt 420 
ccttcctaaa atgcttgctc agaaatctgg 480 
cagcgtcaaa ggagttgtga acagatgtgt 540 
cctcacaaaa tctgtggctg cagatttcat 600 
cccaggaaca gttgatacgc catctctaca 660 
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agaaagaata caagccagag gaaatcctga agaggcacgg aatgatttcc tgaagagaca 720 
aaagacggga agattcgcaa ctgcagaaga aatagccatg ctctgcgtgt atttggcttc 780 
tgatgaatct gcttatgtaa ctggtaaccc tgtcatcatt gatggaggct ggagcttgtg 840 
attttaggat ctccatggtg ggaaggaagg caggcccttc ctatccacag tgaacctggt 900 
tacgaagaaa actcaccaat catctccttc ctgttaatca catgttaatg aaaataagct 960 
ctttttaatg atgtcactgt ttgcaagagt ctgattcttt aagtatatta atctctttgt 1020 
aatctcttct gaaatcattg taaagaaata aaaatattga actcaaaaaa aa 1072 



<210> 53 

<211> 1040 

<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5731013CB1 

<400> 53 

ccttcggctc gagcggcaac 
gcttaggctc cgcggcctcc 
cttggccagc gttctccaca 
cttcagcgtc agccgcgacg 
ggtgatcgac ttcgcccgac 
ctgcgtgccc agagtagtgg 
ctgcaagtcg gtcgaggaga 
ggacgtgatc cgcatccgca 
gcaccccttc accaacaagc 
tcctgcccca gcccaggtgc 
ctttggactg ttactccggt 
atggaaccca tcaatgggca 
gtttctttcc agtgcctgct 
agtggcatcc ctaaaacctg 
tttaaatcca aattgataat 
ctgaagaatc cctgatgtct 
tgggctaggc actttatatc 
ccaaggtcaa aaaaaaaaaa 



<210> 54 

<211> 1624 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 5731162CB1 

<400> 54 

ctcagcctcg cgagcagctg ggactacagg tgcccgccac cacgcccggc taattttttg 60 
tatttttagt agagacgggg tttcaccgtg ttagccagga tggtctcgat ttcctgacct 120 
cgtgatccgc gcgcctctgc ctccaaagtg ctgggattac aggcccgagc cgccgcaccc 180 
ggcctgcagg tgttgttctg accccagctc cactgtgcca gcccgacttg agatgccaag 240 
tatcttgggc ctgaggattg aggcctagag agaggcaggc gttcctcaga gtcacaggga 300 
atggcggcgc ctggactggg actcagccca gctgcttggc ctgaccctct cacagcataa 3 60 
tttcccggca cctggaggcc gcgcttagaa gccgcccagt gccctgagcg tctccatggc 420 
ctgcctgagc ccctcgcagc tccagaagtt ccaacaggat ggattcctgg tgctggaagg 480 
attcttgtct gcggaagagt gtgtggccat gcaacaaagg attggcgaga tagtggctga 540 
aatggatgtt cctctccact gccgcacaga attctccacc caggaagagg agcagcttcg 600 
agcccagggc agcacagact atttcttgag cagtggtgac aagattcgat tcttctttga 660 
gaaaggcgtt tttgatgaga aaggaaattt cctggtccct ccggagaaat ccatcaacaa 720 
aattggccac gctctgcacg cccacgaccc cgtcttcaag agcatcacac actccttcaa 780 
ggtgcagacc ttggccagaa gtctgggcct ccagatgccc gtggtggtgc agagcatgta 840 
catctttaag caacctcact ttggcggtga agtctcccct catcaggacg cctccttcct 900 
gtacacggag cccctgggcc gggtgctggg cgtgtggatc gcagtggagg atgccacgct 960 
ggagaacggc tgtctctggt tcatccctgg ctcccacacc agtggtgtgt caagaaggat 1020 
ggtccgggcc cctgttggct cagcgcctgg taccagcttc cttgggtcag agccagcccg 1080 



cggaagcctc gaggtttagt 
aagctgtagc tatgacggcg 
acggactggg tcgctatgtg 
gcgcctcgtc tcgcggcgcc 
ggaatccagg ggtcgtaata 
ccgaatacct taacggggct 
tctcgacgct ggtgcagaag 
agcccttcca caccgacaac 
cgaccacgtt ccgcgggcta 
aagcacagtg aagagttgcc 
aaaggtggtt cttccccttt 
agttgacaga ggttctgctt 
tctgggggca gtgaccttgt 
agatgaggaa gacttcaagg 
aatgatctca aaacacagtg 
tattggaaca accactgagc 
tgtgtgaata cagatttata 



cccgcccccc tctcctcgct 60 
cgcgggactc cgagccgctt 120 
cagcagctgc agcgtctgag 180 
agggagttcg tggagcggga 240 
tatgtaaact cgcgtccgtg 3 00 
gtgcgcgagg agagcatcca 3 60 
ctggccgacc agtcgggctt 420 
cctagcatcc agggccagtg 480 
cgcccccgag aggttcagga 540 
ccaccaactg cagccccagg 600 
gggattccaa gcccaggcaa 660 
gggataatga agagctgcct 720 
gaaccactca tttttatgca 780 
gttttacagg acccttgttt 840 
agaggtctga aggctggctt 900 
tacggagagc tctgctgtga 960 
aaacaggtta ataaacttat 1020 
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ggataacagc ctctttgtgc ccaccccagt 
agaagtggta cacaagagca agcagaacct 
ccacctcatg gaggcctctg gcaccacctg 
tgaactgccc tttccccaac tgtacaccta 
ctcccgggtg aagctgtggg ctgtaaacac 
agggaggtct tgtctcccct cctgggcttt 
gggtcagggg cttccctaag atcttcacct 
tgaggaggct tctcagccac caaagggttc 
atggaactct ggttattatg gtgttagtta 
aaaa 



gcagagaggg gccctggtcc tcatccatgg 1140 
ctctgaccgc tcgcgccagg cctacacttt 1200 
gagcccggag aactggctcc agccaacagc 1260 
aaggctctcg cagggcagga gccctcgccc 1320 
cagtgccttg ctcagcctcc tggttgcaac 1380 
cctcctgccc tgtgggcagc agcctaggct 1440 
ctctgcctcc ctactgcccc aacatagcct 1500 
tggccccttc tcactctcct ctcctctcag 1560 
tcgaataaaa acgacttcag aatgaaaaaa 1620 
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Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 



This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 

1. [x] Claims Nos.: 

because they relate to subject matter not required to be searched by this Authority, namely: 

Although claim 18 is directed to a method of treatment of the human/animal 
body, the search has been carried out and based on the alleged effects of the 
compound/composition. 

2. 0 Claims Nos.: 20, 21, 23, 24 

because they relate to parts of the Internationa! Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 

see FURTHER INFORMATION sheet PCT/ISA/210 



3. Claims Nos.: =* 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

see additional sheet 



1 " D &£te£*e daims diti ° nal **** app8cant < mis International Search Report covers ail 

2 - Q ^j^^^ cou,d be searched witho * effort justifying an additional fee, this Authority did not invite payment 



3 " D ^^^lnl^^^^? na{ SearCh f T Wer S tlmety paid me this International Search Report 

covers only those claims for which fees were paid, specifically claims Nos.: 



bJ ^Zf| u L r ^ ^^search fe « were timely paid by the applicant. Consequently, this International Search Report is 
restncted to the invention first mentioned in the claims; it is covered by claims Nos : 



1-28 partially 



Remark on Protest 



[ ] Tne additional search fees were accompanied by the applicant's protest 
| ^] No protest accompanied the payment of additional search fees. 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-28 (partially) 

isolated peptide comprising SEQ. ID. NO: 1, a sequence 
90% identical, an active fragment thereof, an immunogenic 
fragment thereof, an isolated polynucleotide encoding the 
polypeptide and corresponding to SEQ. 10. NO: 28, cells 
transformed with the polynucleotide linked to a promoter a 
transgenic animal comprising the polynucleotide linked to a 
promoter, an antibody recognizing the polynucleotide^ 
mothod of detecting the polynucleotide by PCr or 
hybridization, a composition containing the polypeptide 
methods for trating a disease by administering the 
composition, methods for screening agonists and antagonists 
of the polypeptide, compositions and methods linked to the 
alteration of the expression of the polypeptide, methods for 
determining the toxicity of the polypeptide. 

2. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 2 and 29 

3. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 3 and 30 

4. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 4 and 31 

5. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 5 and 32 

6. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 6 and 33 

7. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 7 and 34 

8. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 8 and 35 
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9. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 9 and 36 

10. Claims: 1-28 (partially) I 

Same as 1. with reference to SEQ. ID. NO: 10 and 37 j 

11. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 11 and 38 

12. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 12 and 39 

13. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 13 and 40 

14. Claims: 1-28 (partially) j 

Same as 1. with reference to SEQ. ID. NO: 14 and 41 

15. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 15 and 42 

16. Claims: 1-28 (partially) j 

Same as 1. with reference to SEQ. ID. NO: 16 and 43 

17. Claims: 1-28 (partially) j 

Same as 1. with reference to SEQ. ID. NO: 17 and 44 

18. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 18 and 45 

19. Claims: 1-28 (partially) 

Same as 1. with reference to SEQ. ID. NO: 19 and 46 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 

20. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 20 and 47 

21. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 20 and 48 

22. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 22 and 49 

23. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 23 and 50 

24. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 24 and 51 

25. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 25 and 52 

26. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 26 and 53 

27. Claims: 1-28 (partially) 
Same as 1. with reference to SEQ. ID. NO: 27 and 54 
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Continuation of Box 1.2 
Claims Nos.: 20, 21, 23, 24 



Present claims 20, 21, 23 and 24 relate to an extremely large number of 
possible compounds (agonists/antagonists). In fact, the claims contain so 
many options that a lack of clarity within the meaning of Articles PCT 

"SSIible 30 6Xtent 35 t0 render 3 TOa " in 9 ful se *™* of the claims 



The applicant s attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT) The anniirant 
is advised that the EPO policy when acting as an Intemtlml P 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 

™-n* e J f T? pect1v 5 0f whether or not tne c1ai ™ are amended fo lowing 
receipt of the search report or during any Chapter II procedure 
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